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HERPES VIRUS ENTRY RECEPTOR P 



1 Field of the Invention 



^^^^ The field of this mvention is a herpes virus entry receptor 
(Tn^ir^F), More particularly, the field of the present invention is 
recombinant mammalian IIVER, polynucleotides encoding that HV^, 
and methods of making recombinant lIVER . 

Background of the Invention 
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Glycosaminoglycan chains on cell surface proteoglycans 
serve as receptors for the binding of herpes simplex virus types 1 and 2 
(HSV-1 and HSV-2) to cells. Binding is not sufficient for entry, however: 
otiier cell surface components are necessary for virus entiy, which occurs 
by fusion of the virion envelope with a cell membrane. For example, 
Chinese hamster ovaiy (CHO) cells ejq^ress glycosaminogtycan chains to 
which HSV-1 and HSV-2 can bind, but are resistant to the entry of some 
HSV strains, particularfy HSV-l(KOS). 

The present invention is directed to a newly discovered 
protein that enables herpes simplex virus (HSV) to penetrate into cells 
and is a previously undiscovered member of the family of receptors 
designated the tumor necrosis factor receptor/nerve growth factor 
receptor (TNFR/NGFR) family. Members of this family have 
characteristic repeats of amino add sequence containing multiple 
cysteines and serve as receptors for a variety of specific ligands, including 
but not limited to cytolsm^ The protein is designated herpes virus entry 
receptor protein or ^gR^KR. 
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By identifying the gene that encodesfev=^^by showing 
W that can mediate the entiy of HSV into celk and by peifonning 

experiments to define viral and ceU factors that influence the ability of 
0 to mediate HSV entiy. the inventors have provided the 

5 knowledge and biological material required (i) to develop antiviral drugs 

that can act to block HSV (and perhaps other herpesvirus) entry into 
cells; (ii) to identify other members of flie TNFR^^GFR family (or otiier 
cefl surface molecules) tiiat can serve as receptors for HSV-1, HSV-2 or 
otiier herpesviruses; (iii) to identify tiie nafairal ligand for tiie receptor. 
10 and (iv) to develop flierapeutic approaches for enhancing or inhibiting 
action of die Ugand on the receptor, depending on tiie patiiologic or 
beneficial consequences of tiiis action. 
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UnVf Sum mary of the Invention 

In one aspect, flie present invention provides an isolated 
and purified polynucleotide comprising a nucleotide sequence consisting 
_ essential^ of tiie nucleotide of SEQ ID N(3:l from about nucleotide 

positionSs to about nucleotide position 1189; (b) sequences tiiat are 
0 20 complementary to tiie sequences of (a), and (c) sequences tiiat. when 

expressed, encode a polypeptide encoded by a sequence of (a). A 
preferred polynucleotide is a DNA molecule. In anotiier embodiment, 
tiie polynucleotide is an RNA molecule. A preferred polynucleotide is 
SEQ ID NO:l. 

25 

In anotiier embodiment, a DNA molecule of tiie present 
invention is contained in an expression vector. The expression vector 

preferably furtiier comprises an enhancer-promoter operatively linked to 
tiie polynucleotide. In an especially preferred embodiment, tiie DNA 
30 molecule has tiie nucleotide sequence of SEQ ID NO^om about 

T) nucleotide position's^ to about nucleotide position^llb. 
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In another aspect, the present invention provides an 
oUgonucleotide of from about 15 to about 50 nucleotides containing a 
nudeotide sequence of at least 15 nucleotides that is identical or 
complementary to a contiguous sequence of a polynucleotide of this 
invention. A preferred oligonucleotide is an antisense oligonucleotide 
that is complementary to a portion of the polynucleotide of SEQ ID 
NO:i. 

The present invention also provides a pharmaceutical 
composition comprising a polypeptide or an antisense oligonucleotide of 
this invention and a physiologically acceptable diluent 

In another aspect, the present invention provides^^HV^ 
polypeptide of mammalian origin. In one embodhnent. that HVEk is an 
^ 15 isolated and purified polypeptide of about 300 amino add residues and 

comprises the amino add residue sequence of SEQ ID NO:2. More 
^ ^ferably. an^» of the present invention is a recombinant human 

20 In another aspect, the present invention provides a process 

V of making,^ comprising transforming a host ceU with an expression 

^ vector that comprises a polynucleotide of the present invention. . 

maintainingthetransfonnedcenforaperiodoftimesuffidentfor 
C\ expression of thej^ and recovering the HVER. Preferably, the host 

^ 25 ceU is an eukaiyotic host cell such as a mammalian cell,or a bactenal 

cell. Ail espedany preferred host fea manunalian ovarian cell. The 
D present invention also provid^^nfR^ made by a process of this 

^ invention. A preferred such^HVER is recombmant human HVER. 

30 The present invention still further provides for a host ceU 

transfonned with a polynucleotide or expression vector of this invention. 
Preferably, the host ceU is a mammalian cell such as an ovanan cell. 



Brief Description of the E>rawings 

In the drawings, which form a portion of the specification: 

FIG. 1 shows a map of the plasmid (pBEC580) cloned from 
the cDNA library on the basis of its ability to convert resistant CHO-Kl 
cells to susceptibility to HSV-l(KOS) infection. The cDNA insert 
prepared as described in the text was ligated between BstXI and Not I 
sites in the polylinker region of pcDNAI (shown in the inset). 

5IG. 2 shows the nucleotide sequence (SEQ ID NO:l) of 
the cDNA inser^t^BECSSO and amino acid sequence (SEQ ID NO:2) 
translated from the^^a^eading frame designated HVER. Predicted 
features of the HVER pb^fe^tide include a signal sequence (dotted 
underline), two potential sit^s^^to^'^ addition of N-linked glycans (bold 
underline), a hydrophobic regiM^atfeo^ld potentially span a membrane 
(underline) and three cysteine-rich rfep^ate ch^acteristic of membere of 
the TNF/NGF receptor family (shaded b^^). ^x,^ . ^/ . 

FIG. 3 shows a map of the plasmid (pBEC^) produced by 
transferring the cDNA insert of pBEC580 to the vector, pcDNA3 (shown 
in the inset). The cDNA insert was excised from pBEC580 by cutting 
with Hindm and Xhol and was ligated to pcDNA3 that had also been 
cut with Hindm and Xhol. The position of the cytomegalovirus 
promoter (P-CMV) is shown and also the position of the selectable 
marker Neo, along with upstream and downstream sequences required 
for its expression in eukaryotic cells. 

^ m?j>4^^ws susceptibility of HeLa cells and various CHO 

cell lines to infection ^1HSV-l(KOS). The vahies reported are the 
optical density at 410 nm, Eacfe^pQint represents the mean of triplicate 
(panel A) or quadruplicate (panel B)5feteminations. The individual 
values were within 10% of the mean. A. HetSrc^ (open circles) and_ 



CM(5^H~£B«sidosed circles). B. CHO cell lines stably transfected with 
pBEClO, wW^^i^s^i^HVER cDNA [CHO-A3 (closed triangles); / 
CHO-A12 (open sq«ares)rcilQ^(open triangles); CHO-B9 (closed 
squares); CHO-Bll (open circles)^;M^^trol cell line stably 
transfected with the vector pcDNA3 [CHO^e^osed circles)]. 

FIG. 5 shows repUcation of three HSV strains in CHO cells 
stably transfected with^.^^ and in control CHO cells and HeLa cells. 
Cells plated in 6-weU plates at about 5 x 10* cells per weH were 
inoculated with the virus indicated at Vf PFU per weU, to ensure that all 
susceptible cells were synchronously infected. After aUowing 2 hr for 
virus binding and entry, the cells were washed and treated with citrate 
buffer. pH 3, to inactivate input vims that bound to cells but faUed to 
initiatliiifeclion. Culture medhim was added and one set of cultures 
harvested immediately (2 hr after addition of the virus inoculum) for 
quantitation of infectious virus by plaque assay on Vero cells, to 
determine the baseline viral titer prior to the appearance of progeny 
virus (black bars). The remainder of the cultures were harvested at 31 hr 
for quantitation of viral progeny (diagonal-hatched bars). The vahies 
presented represent half the yield from each culture. 

^^G. 6 shows CHO-A12 cells that express HSV-1 or HSV-2 
gP are resistantVHSV.l(KOS) infection. Hie results shown are for the 
amount of plasmid ^KA giving maximal interference (1.5 /tg per well for 
the gD-l-expressing pla^and 2.0 per weU for the gD-2-expressing 
plasmid). n»e control plas^^was used at 1.5 per well and the CHO- 
Kl cells were not transfected. tli^es given are the means of 
quadruplicate determinations. 

mo 7 shows a map of the plasmid (pBL58) expressing the 

VlOljiA/vFc 

^p\^l-ig-hybrid protein. 



^^**^^Gs. 8A and 8B show the nucleotide sequence (SEQ ID 
NO:6) of pBL^d the amino acid sequence (SEQ ID NO:7) of the 
open reading fra^ncodingHVER-Ig. Features of the HVER 
ectodomain that w^escribed in FIG. 2 are shown here along with the 

site at which the HvV^««<=« ^ ^".^ *^ ^'^'^ 
sequence (the boxed reW^^ three amino acids inserted at the fusion 
site due to the EcoRI linlter ^ed). The two potential sites for the 
addition of N-linked glyca,^ are underlined along with a third 

site in the IgG sequence. \ 

FIG^shows a schematic drawing of human^lfVER. The 
protein has characteristics of a typical type I membrane glycoprotein, 
including an N-terminal signal sequence (diagonal-hatched box) and a 
membrane-spanning region (cross-hatched box). The protein also has the 
cysteine-rich repeats characteristic of the TWR/NGFR family of ceU 
surface receptors. Each repeat has 4-6 cysteine residues (represented by 
vertical lines). 

to 

FIG. >»'shows the relative susceptabilities of A12 cells 
transfected with various gD-expressing plasmids. 

p^«i.nf H Descriptiofi »f the Invention 

J The Invention 

The present invention provides isolated and purified 
polynucleotides that encode^HVHfl of mammalian origm. egression 
vectors containing those polynucleotides, host ^gtoansformed with 
those egression vectors, a process of making^HV^ 
polynucleotides and vectors, and isolated and purified HVER. 

H. ^^4VER Polynucleotides 
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In one aspect, the present inve^on provides an isolated 
>olyni 

mammalian origin. 



and purified polynucleotide that encodes an^H\^BR polypeptide of 



Uv)£>lU ^ polynucleotide of the present invention that encodes 

^ HVCR is an isolated and purified polynucleotide that comprises a 

nucleotide sequence consisting essentially of the jpni^cleotide sequence of 

SEQ ID NO:l from about nucleotide position 295 to about nucleotide 

uuz- A. 
position 1189 of SEQ ED NO:l, (b) sequences that are complementary to 
\, 

the sequences of (a), and (c) sequences that, when expressed, encode a 
polypeptide encoded by the sequences of (a). A preferred pofynucleotide 
is a DNA molecule. In another embodiment, the polynucleotide is an 
RNA molecule. 



nucleotide sequence and deduced amino acid residue 
sequence of humj^ftV^ are set forth in FIG. 2. The nucleotide 
sequence of SEQ m^O:l in FIG. 2 is a full length DNA clone of 
human HVER. SEQ I]LKNO:2 in FIG. 2 is the deduced amino acid 
residue sequence of that cIoti 

The present invention also contemplates DNA sequences 
which hybridize under stringent hybridization conditions to the DNA 
sequences set forth above. Stringent hybridization conditions are well 
known in the art and define a degree of sequence identity greater than 
about 70%-80%. The present invention also contemplates naturally 
occurring allelic variations and mutations of the DNA sequences set forth 
above so long as those variations and mutations code, on expression, for 
an HVER of this invention as set forth hereinafter. 



As set forth above, SEQ ED NO:l, is a full length cDNA 
clone of human ^^VER. As is well known in the art, because of the 
degeneracy of the genetic code, there are numerous other DNA and 
RNA molecules that can code for the same polypeptide as those encoded 



by SEQ ID NO:l. The present invention, therefore, contemplates those 
other DNA and RNA molecules which, on expression, encode for the 
polypeptide encoded to, SEQ ID NO:l. Having identified the amino acid 
residue sequence oyiV^. and with knowledge of aU triplet codons for 
each particular amino acid residue, it is possible to describe all such 
encoding RNA and DNA sequences. DNA and RNA molecules otfier 
than tiiose specifically disclosed herein and, which molecules are 
characterized simply by a change in a codon for a particular amino acid 
are within the scope of this invention. 
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A Table of codons representing particular amino acids is 
set forth below in Table 1. 

TABLE 1 
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Pro 
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Val 


Ala 


Asp 


Gly 


T/U 






Val 
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Gly 


C 
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Glu 


Gly 
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Ala 
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G 
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simple change in a codon for the same amino acid 
residue within a polynucleotide will not change the structure of the 
encoded polypeptides. By way of example, it can be seen from SEQ ID 
NO:l (see FIG, 2) thar>a CCT codon for proline exists at nucleotide 
positionsjg39=S41, It can 
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however, that proline can 1 

nucleotide positions 323=325)* 
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^ be seen from that same sequence, 
J by a CCC codon (see e.g., 
^stitution of the latter CCC codon for 
proline with the CCT codon for proHpe, or vice versa, does not 
substantially alter the DNA sequence dfSEQ ID NO:l and results in 
e3q>ression of the same polypeptide. In a\miilar manner, substitutions of 
codons for other aniino acid residues can be inade in a like manner 
without departing from the true scope of the pre^nt invention. 
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A polynucleotide of the present invention can also be an 
RNA molecule. A RNA molecule contemplated by the present invention 
is complementary to or hybridizes under stringent conditions to any of 
the DNA sequences set forth above. As is wcD known in the art, such a 
RNA molecule is characterized by the base uracil in place of thymidine. 
Exemplary ^djjreferred RNA molecules are mRNA molecules that 

encode an IRZER of this invention. 
A 

The present invention also contemplates oligonucleotides 
from about 15 to about 50 nucleotides in length, which oligonucleotides 
serve as primers and hybridization probes for the screening of DNA 
Kbraries and the identification of DNA or RNA molecules that encode 

Such primers and probes are characterigedin that they will 
\ybridize to polynucleotide sequences encoding ffflBR or related 
receptor proteins. An oligonucleotide probe or primer contains a 
nucleotide sequence of at least 15 nucleotides to or 

complementary to a contiguous sequence of an «VER polynucleotide of 
the present invention. Thus, where an oligonucleotide probe is 25 
nucleotides in length, at least 15 of those nucleotides are identi^or 
complementary to a sequence of contiguous nucleotid^ an^HVER 
polynucleotide of the present invention. Exemplaiy^W^R 
polynucleotides of the present invention are set forth above. 

A preferred oligonucleotide is an antisense oligonucleotide. 
The present invention provides a synthetic antisense oUgonucleotide of 
less than about 50 nucleotides, preferably less than about 35 nucleotides, 
more preferably less than about 25 nucleotides and most preferably less 
than about 20 nucleotides. An antisense oligonucleotide of the present 
invention is directed against a DNA or RNA molecule that encodes 

Preferably, the antisense oUgonucleotide is directed against the 
Votein translational initiation site or the transcriptional start site. In 
accordance with this preferred embodiment, an antisense molecule is 
directed against a region of SEQ. ID NO:l from about nucleotide 
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positio^^to about nucleotide position^333. It is understood by one of 
ordinary skiU in the art that an antisense oligonucleotide can be directed 
either against a DNA or RNA sequence that encodes a specific target 
Thus, an antisense oligonucleotide of the present invention can also be 
directed against polynucleotides that are complementary to those shown 
in SEQ. ID NO:l as well as the equivalent RNA molecules. 

Preferably, the nucleotides of an antisense oligonucleotide 
are linked by pseudophosphate bonds that are resistant to cleavage by 
exonuclease or endonuclease enzymes. Preferably the pseudophosphate 
bonds are phosphorothioate bonds. By replacing a phosphodiester bond 
with one that is resistant to the action of exo-and/or endonuclease, the 
stability of the nucleic acid in the presence of those enzymes is increased. 
As used herein, pseudophosphate bonds include, but are not limited to, 
methylphosphonate, phosphomorpholidate, phosphorothioate, 
phosphorodithioate and phosphoroselenoate bonds. 

An oligonucleotide primer or probe, as well as an antisense 
oligonucleotide of the present invention can be prepared using standard 
procedures well known in the art A preferred method of polynucleotide 
synthesis is via cyanoethyl phosphoramidite chemistry. A detailed 
description of the preparation, isolation and purification of 
polynucleotides encoding humanj^HVER is set forth below. 

Briefly, CHO-Kl cells are resistant to the entry of HSV- 
l(KOS). The present invention discloses an assay to screen for human 
cDNAs encoding proteins capable of conferring susceptibility to HSV- 
l(KOS) infection on the CHO-Kl cells. Control and transfected CHO- 
Kl cells were exposed to a strain of HSV-l(KOS) that had been 
modified to express E, coU beta-galactosidase, under control of a human 
cytomegalovirus promoter, immediately after viral entry into a ceD. Any 
transfected cells that became susceptible to HSV-l(KOS) entry expressed 
beta-galactosidase after infection. Addition of the appropriate beta- 
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galactosidase substrate (X-gal) caused the infected cells to turn blue. 
The high level of resistance of the CHO-Kl cells to HSV-l(KOS) 
infection made it possible to detect very smaD numbers of cells rendered 
susceptible to infection by transfection of the human cDNAs. 

A commercially obtained unidirectional cDNA library 
prepared from human HeLa cell mRNA was used for the transfections. 
The plasmids in this horary express human proteins under control of the 
human cytomegalovirus promoter, after transfection into eukaryotic cells. 
The cDNA Ubraiy was purchased from Invitrogen Corp (3985 B Sorrento 
Valley Blvd., San Diego. CA 92121): 

catalog no. A950-10 

mRNA source HeLa cells (a human ceU line derived from a 
carcinoma) 

primer oUgodT(NotI) 

vector pcDNAI 

This Hbraiy was constructed using materiaU produced by 
Invitrogen according to the following protocol: 

mRNA was isolated from the HeLa cells using the 
Invitrogen FastTrack® mRNA Isolation Kit The mRNA was copied by 
AMV reverse franscriptase. using an oligo dT(NotI) primer, to produce 
the first strand of DNA. The sequence of this primer is 5'- d 
PO,[AACCCGGCrcGAGCGGCCGCT„h3'(SEQIDN0.3). The 

underlined sequence is the NotI site used m a later step for cleavage of 
the cDNA and its insertion into the vector in a directional fashion. 

The product was converted to double-stranded DNA by 
DNA polymerase in combination with RNaseH. and E. coti DNA ligase. 
Any sticky (single-stranded) ends were made btant (filled in) by use of 
T4 polymerase. A BxtXl/EcoRI adapter was added to the ends by bhint- 
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end ligation. The sequence of the adapter is: 

GAATTCCACCACACTTAAGGTG (SEQ ID NO.:4). The cDNA was 
cut with BstXI and Soil and cloned directionally by sticky-end Ugation 
into pcDNAI, which had also been cut with BstXI and Not I. 

The plasmids were used to transform E. coU strain 
MC1061/P3. The number of primary recombinant plasmids was 
estimated to be about 13 x 10*. TTie number of colonies in the amplified 
hTjrary was 4.5 x 10' per ml. TTie estimated size range of the inserts was 
0.9 kb to 1.6 kb. 

1.5 X 10' bacteria were plated (1.5 x 10* bacteria per large 
Petri plate for a total of 100 plates) to allow the growth of bacterial 
colonies. TTie colonies were scraped from each plate to yield one pool of 
bacteria from each plate. Samples of the 100 bacterial pools were mixed 
to yield 10 mixtures of 10 pools each. Plasmid DNA was extracted from 
each mixture of pools by standard means. 

Each plasmid DNA mixture was prepared with 
UpofectAMINE" (GibcoBRL), according to the manufacturer's 
directions, for transf ection into Chinese hamster ovaiy ceUs. strain Kl 
(CHO-Kl). 

To determine whether any of the transfected cells became 
susceptible to HSV-l(KOS) infection, the transfected cells and control 
cells (unfransfected or transfected wifli irrelevant DNA) were exposed to 
a mutant form of HSV-l(KOS) at an input dose sufficientiy high to infect 
all susceptible cells. TTiis mutant is deleted for one of tiie essential 
glycoproteins. gU and must be propagated on gl^expressing Vero cells. 
n,e virus produced on tiie gL-expressing cells is fully infectious but can 
undergo only one round of replication because defective virus is 
produced in non-complementing cells. The gL open reading frame was 
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replaced by the E. coli \^dL gene, downstream of the strong 
cytomegalovirus promoter. The lacZ gene encodes beta-galactos.dase. 

After exposure to virus for several hours, the transfected 
5 CHO-Kl cells were fixed and incubated with the beta-galactosidase 

substrate. X-gal. Susceptible cells were readily identified by their blue 
color resulting from conversion of the substrate to an insoluble blue 
preceipitate by the beta-galactosidase expressed after entry of the mutant 
HSV-l(KOS). 

DNA from one mixture of ten pools was found to be 

positive for ability to convert some of the transfected CHO-Kl cells to 
IIsceptibiHty. Each of the ten bacterial pools in this mixture was ^ted 
separately by extracting plasmid DNA and repeating steps set forth 
15 above. Pool 82 was found to be positive. 

Bacterial pool 82 was itself divided into 100 subpools as 
described above. It was found that subpool 53 was positive. The 
bacteria in subpool 53 were plated and 900 individual clones were picked 
20 and grown up. Plasmids DNAs were extracted from each of the clones 
for testing. Qone 580 was found to be positive. QoneSSOwas 
designated pBEC580. A map of this plasmid is shown in HG. 1. 

n.e nucleotide sequence of the cDNA insert of pBEC580 
^ determined by use of the Sequenase® kit (US Biochemical Corp) 
according to the manufacturer's instructions. 
. \ ^PCgene suite of software from Intelligenetics. Inc. was 

/ used to analyze ^^^de sequence. As shown in FIG. 2, one open 
readingframewasfou^S^H^rrectorientati^ The protem encoded 
in this open reading frame was d^^ a herpesvirus entry receptor 
protein (^) and was found by seqi^^&c^an^ to have propert.es 
6i a type^ membrane glycoprotein. Shown in ^IG^arel) the 
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predicted signal peptide that could direct translocation of the nascent 
peptide across membranes of the rough endoplasmic reticulum; 2) two 
sites that are signals for the addition of N-linked carbohydrate; and 3) a 
hydrophobic region that is predicted to be a membrane-spanning region, 
adjacent to a very basic region which could serve to anchor the protein in 
a niiembrane. 



The blastp and blastn programs were used to search 
databases maintained by the National Center for Biotechnology 
Information (NCBI), National Library of Medicine, National Institutes of 
Health, in Bethesda, MD, for proteins or nucleotide sequences that 
might be identical to, or related to, those of the cDNA insert The 
blas^i program was used to search fofj^fM^R-related protein sequences 
in the database updated daily that contains non-redundant protein 
sequences from five component databases (Brookhaven Protein Data 
Bank, the SWISS-PROT database. The PIR database, the coding 
sequence translations from the GenBank databases and two other 
databases that contain cumulative weekly or daily updates, respectively, 
of the SWISS-PROT database and the translations from Gen Bank). 

T^is search failed to detect any closely related proteins, 
indicating that has not been previously described. The blastp 

program identified about 30 proteins that share a characteristic sequence 
motif with , namely three or more cysteine-rich repeats with a 

characteristic pattern of 6 cysteine residues. These other proteins that 
are related to^^^^ by this motif are all members of the TNF/NGF 
receptor family. They encode membrane receptors that can be triggered 
by the binding of specific ligands to activate specific pathways important 
to cell survival, apoptosis or induced protective responses against 
infectious agents or trauma. 

The blastn program identified two entries in the DNA 
database (the combined non-redundant database consisting of nucleotide 
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sequence entries from the Brookhaven Protein Data Bank, GenBank, the 
EMBL Data Libraiy and cumulative daily updates of the GenBank and 
EMBL databases) that provide partial nucleotide sequence information 
for cDNAs that are very closely related to the cDNA encoding 

A- 

One entry (locus HSCX)BG042) provides partial sequen^^tibat is closely 
related to sequence in the 3' non-coding region of the cDNA. 
The other entry (locus HSO0BG041) provides partial sequence that is 
closely related to sequence in the 5' non-codin^^regi^ and extending 43 
amino acids into the N-terminal region of the^^f^h^ open reading 
frame, but not extending into the TNF/NGF receptor motifs. 



The cDNA insert was transferred to another vector, 
pcDNA3, which carries a selectable marker (the neomycin gene) that can 
be used to isolate cell lines stably canying the plasmid. Cells that cany 
and express this gene are resistant to the toxic effects of a drug called 
G418. The cDNA insert of pBEC580 was excised by cutting with Hindlll 
and Xhol and the insert was ligated to pcDNA3, which had also been cut 
with Hindm and Xhol, to produce the new plasmid called pBEClO. A 
map of pBEClO is shown in FIG. 3. 

CHO-Kl cells were transfected with pBEClO or pcDNA3 
and, after about 48 hours, incubated with medium containing G418. 
Only cells carrying the plasmid (with the Neo marker) were able to 
survive, 

Se^ei^ stably transformed colonies of cells were isolated 
after transfection with^ach plasmid and were cloned. None of the 
clones obtained with pcDNA^^vere susceptible to HSV-l(KOS) 
infection. About half of the clones OThK^d with pBEClO were 
susceptible (the resistant clones may not havM^en able to express the 
protein encoded in the cDNA insert). Cells plate^HP 96-welI dishes, at 
densities ranging from 10* to 5 x 10* cells per well, werc"^€roosed to 
HSV-l(KOS)gL86 in the quantities indicated. At 6 hr afterW^dition 



if^^ims^^heceUs were solubilized with detergent and beta-galactosidase 
substrate adSixlto assess the efficiency of viral entry. Bcpression of 
beta-galactosidase^ig^ that the virus has entered a ceU and the 
amount of enzyme proAlfce^fe proportional to the number of infected 
cells, at least until plateau val^^beta-galactosidase activity are 
achieved. FTGs. 4 and 5 show that eHP-Kl parental cells and CHO 
cells transfected with the control plasmi^ >cpNA3. are resistantt«^^ 
l(KOS) infection whereas the cells transfected with, and stably c^^ying 
pBEClO. are susceptible to HSV-l(KOS) infection. 

Although the cells transfected with the^H^R cDNA are 
fully susceptible to infection by HSV-l(KOS). tfiey are resistant to 
infection by a mutant of HSV-l(KOS), designated HSV-l(KOS)ridl. tiiat 
dffiers from parental virus only by an amino add substitution in tiie viral 
envelope glycoprotein gD. This indic^ that gD, at least in part, 
determines the ability ofWrus to use^H^ for entry. Because HSV- 
l(KOS) expressing the mutant form of gD can infect human cells almost 
as efficiently as parental HSV-l(KOS). there must be cell surface 
molecules expressed in human ceUs, in addition to^^. that can be 
used for entry. 

CHO-A12 cells in 6-well plates were ti^nsfected with 
plasmids that eqjress HSV-1 gD (pRE4) or HSV.2 gD (pWW65) under 
conti^ol of tiie Rous sarcoma vims promoter or witii a control plasmid 
consisting of the vector with no insert (pdH). These plasmids were 
obtained from G. Cohen and R. Eisenberg (Univ. of Pennsylvania). 
Transfection was done with the UpofectAMINE'- reagent (GibcoBRL) 
using plasmid quantities ranging from 0.5 to 2.5 per well. At 24 hr 
after to^nsfection, the cells were replated in 96-weU plates and, 12 hr 
later, were exposed to HSV-l(KOS)gL86 to assess the susceptibility of 
the cells to infection. 



'/ 
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Transaction o^tR^R-expressing CHO cells with a 
plasmid that expresses^qW-type gD (either the HSV-1 or HSV-2 fonns 
of gD) confers resistance tOH^fection by HSV-l(KOS) (See FIG. 6). 
This is an interference activity that has been previously described. 

When gD is expressed by the cell, it^^n render a susceptible ceU 
resistant to HSV-1 infection, possibly b^v^questering a cell surface 
receptor needed for HSV-1 entry. The facis^at gD expression renders 
the^ffVl^-expressing ceUs resistant to HSV-l(KpS) infection suggests 
that there may be a direct physical interaction betv^e^n gD (both the 

HSV-1 and HSV-2 fonns) and W^. 

A. - 
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Table 2 below lists the cell lines c^tained and summarizes 
some of their properties: 

Table 2. 



Cell line 


Plasmid 
used for 
Iransfection 


Susceptible to infection by: 


HSV-l(KOS) 


HSV. 
l(KOS)ridl 


HSV-t{F) 


HSV-2(333) 


Cell lines obtained from others or from culture collections: 


Hel^ (human) 
Hep-2 (human) 
CHO-Kl (hamster) 


None 
None 
None 


Yes 
Yes 
No 


Yes 
Yes 
No 


Yes 
Yes 

PartiaUy 


Yes 
Yes 
Yes 


New cell lines 


CHO.A3 

CHO-A12'' 

CHO-B3 

CHO-B9 

CHO-Bll 

CHO-C8 


pBEClO 
pBEClO 
pBEClO 
pBEClO 
pBEClO 
pcDNA3 


Yes 
Yes 

partially 
Yes 
Yes 
No 


N.T/ 

No*^ 

N.T/ 

N.T.* 

N.T.' 

No*^ 


N.T,* 

Yes 

N.T.' 

N.T.- 

N.T.' 

N.T.' 


N.T/ 

Yes 

N.T.' 

N.T/ 

N.T/ 

N.T.' 
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• N.T.-Not tested. 

** When the A12 cells were transfected with plasmids e^ressing HSV-1 

or HSV-2 gD, they became resistant to HSV-l(KOS) infection. 

*CHO-Kl cells are slightly more susceptible to infection by HSV- . 

l(KOS)ridl than by parental HSV-l(KOS) but the expression of 

A 

in the transfected cells does not enhance susceptibility of the cells to 
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HSV-l(KOS)ridl, in marked contrast to the results obtained with HSV- 
l(KOS). 



Southern blots were done with digests of DNA from three 
human cell lines (Hep-2, HeLa and HT1080), one monkey ceU ^ 
(Vero), the Chinese hamster overy ceU line used for clonin^HVER 
(CHO-Kl) and two of the CHO ceU Unes stably tranfected with pBEClO 
rCHO-A12 and CHO-B9). The probes used to detect DNA fragments 
with homology to IFWHl were an EcoRI fragment of the^hVER cDNA 
insert that inchides most of them^ and a smaUer PvuH fragment that 
inchides only the 3' end of the^»«R open reading frame and some of 
the on-coding sequence downstream. The results shovi^dl^at: (i) all 
three human ceU lines contain DNA homologous to^H^R with 
fragment sizes that are the same for aU three cell lines in a single digest 
(different digests yield hybridizable bands of different sizes but the DNAs 
from three ceU lines are indistinguishable); (ii) only a subset of the 
human DNA fragments that hybridize to the larger EcoRI fragment also 
hyubridize to the smaUer PvuH fragment; (ui) the monkey cells contain 
weakly hybridizable DNA fragments of different sizes from those found 
in the human DNAs; (ic) the parental CHO-Kl ceUs contain no 
hybridizable DNA fragments; (v) the stably t^a^sfectedcell lines (CHO- 
A12 and CHO-B9) contain DNA homologous io^^s^SR as predicted. 

The results obtained with the human, monkey and Chinese 
hamster DNAs confirm thatj!^^ is encoded by a human cDNA and 
indicate that the human J^J^ gene is probably a single-copy gene with 
multiple introns and exons, perhaps extending over a large stretch of 
DNA The results also indicate that monkey ceUs have a gene related to 
human^fVER. K Chinese hamster cells have an^HVER g^°^^>Jg^^ 
sequence has diverged too much to be detected by a human^lfi«R 
probe. 



Poly-adenylated RNAs extracted from various human 
tissues (heart, brain, placenta, lung, liver, skeletal muscle, kidney, and 
pancreas) were obtained from Clontech, Inc., as samples that had already 
been fractionated by electrophoresis and transferred to a membrane. 
5 The membrane was used for hybridization withjme larger ,EroR» probe 
mentioned above (almost the entire HVER cDNA insert). The results . 
showed that there were variable amounts of RNA homologous to ftVER 
in all the samples. The largest amounts were found in lung and kidney. 
The sizes of the bands were about 1.8 and 3.8 kb. The iScER cDNA 
10 insert claimed in the application is about 1.8 kb. 

in. HVER Pofypeptides \4t;^jti 

In another aspect, the pre^ntmvention provides asiWVSR 
polypeptide of mammalian origin. An.H\^R of the present invention is 
15 a polypeptide of about 300 amino add residues. Preferably, airBWBR is 

a human*K£R. A human form of HVBR is shown in SEQ ID NO:2. 
Thus, human ^^IVER can be defined as a polypeptide of about^30O- or less 
amino acid residues comprising the amino acid residue sequence of SEQ 
ID NO:2. 

20 

The present invention also contemplates amino acid residue 
sequences that are substantially duplicative of the sequences set forth 
herein such that those sequences demonstrate like biological activity to 
disclosed sequences. Such contemplated sequences include those 
25 sequences characterized by a minimal change in amino acid residue 

sequence or type (e.g., conservatively substituted sequences) which 
insubstantial change does not alter the basic nature and biological activity 



30 It is well known in the art that modifications and changes 

can be made in the structure of a polypeptide without substantially 
altering the biological function of that peptide. For example, certain 
amino acids can be substituted for other amino acids in a given 
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polypeptide without any appreciable loss of function. In making such 
changes, substitutions of like amino acid residues can be made on the 
basis of relative similarity of side-chain substituents, for example, their 
size, charge, hydrophobidty, hydrophilicity, and the like. 

5 

As detailed in United States Patent No. 4,554,101, 
incorporated herein by reference, the following hydrophilicity values have 
been assigned to amino acid residues: Arg (+3.0); Lys (+3.0); Asp 
(+3.0); Glu (+3.0); Ser (+0.3); Asn (+0.2); Gin (+0.2); Gly (0); Pro (- 
10 0.5); Thr (-0.4); Ala (-0.5); His (-0.5); Cys (-1.0); Met (-13); Val (-1.5); 
Leu (-1.8); De (-1.8); Tyr (-2.3); The (-2.5); and Tip (-3.4). It is 
understood that an amino acid residue can be substituted for another 
having a similar hydrophilicity value (e.g., within a value of plus or minus 
2.0) and still obtain a biologically equivalent polypeptide. 

15 

In a simflar manner, substitutions can be made on the basis 
of similarity in hydropathic index. Each amino acid residue has been 
assigned a hydropathic index on the basis of its hydrophobicity and 
charge characteristics. Those hydropathic index values are: De (+4.5); 
20 Val (+4.2); Leu (+3.8); Phe (+2.8); Cys (+2.5); Met (+1.9); Ala (+1.8); 
Gly (-0.4); Thr (-0.7); Ser (-0.8); Tip (-0.9); Tyr (-1.3); Pro (-1.6); His (- 
3.2); Ghi (-3.5); Ghi (-3.5); Asp (-3.5); Asn (-3.5); Lys (-3.9); and Arg (- 
4.5). In making a substitution based on the hydropathic index, a value of 
within plus or minus 2.0 is preferred. 

25 

A comparison of the amino acid sequence SEQ ID NO:2 
with the protein databases maintained at the National Library of 
Medicine (NIH) and with a computer program designed to detect 
functional motife in proteins revealed that^HV^BR has not previously 
30 been described, that it is not closely related to other proteins in the 
database, but that it has three copies of a cysteine-rich modf found in 
members of the TNFR/NGFR family. 
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heavy chain fragment. This latter fragment was prepared for the ligation 
by using PCR technology to insert an EcoRI site just upstream of the 
foUowing rabbit sequence (ACAAGACCGTTGCACCCTC) (SEQ ID 
NO:5), Cleavage at this EcoRI site, followed by filling-in, permitted 
5 bhint-end ligation to the PvuII site otftVER so that the two open 

reading frames were joined in the same reading frame. The 3* end of the 
rabbit cDNA insert was cut with PstI and joined to the Hindin site of 
pGEM4 by blunt-end ligation (the PstI cut end was trimmed and the 
Hindn cut end was filled in). The vector sequences are from pGEM4 
10 and include sequences extending to a unique Nhel site that was joined by 

sticly-end ligation to an Spel site adjacent to the cytomegalovirus 
promoter (P-CMV) from pcDNAneo. The other end of the P-CMV 
region was cut with Hindlll and is joined to the Hindlll site at the top of 
the mapj^^^ 

15 

Expression of the hybrid protein has been demonstrated 
both in Vero cells and in CHO-Kl cells. The protein is secreted into the 
medium, as predicted, since it should have a signal sequence for 
translocation into the cell's secretory pathway but has no membrane- 
20 spanning region to anchor it to a membrane. The hybrid protein is 

readily detected on Western blots by use of commercially available 
antibodies specific for the constant regions of rabbit IgG. 

Treatment of the hybrid protein with various glycosidases 
25 (endoH, endoF and endoO) has revealed that the protein carries N- 

linked gfycans of the complex type, which is characteristic for secreted 
proteins, and also carries O-linked glycans, as predicted. This hybrid 
protein is used to screen for mouse monoclonal antibodies specific for 
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-HVER and to identify HSV proteins with which it might mteract 



\ 'llie-hj^rid gene is contained in two different expression 

^/ plasmids, the latter ofwSi51r«Qiitains a selectable marker for obtaining 
transformed cells that stably carry ^e^pfciOTiid. Transfection of these 
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inciting pGEM3, pGEM4 and pcDNAneo. Startmg from the Hindm 
„ site ofNoBLSS, part of the polylinker from pGEM3 (Hindm site to Xbal 
site) wa^Unked to a sticky end created by cutting the^f^W^ insert with 
Nhel aboik 37 nudges upstream of the start codon. Another 

cleavage of W HVER insert at a PvuH site within the open reading 
frame createdV blunt end that was blunt-end ligated to the rabbit IgG 
heavy chain frag^nt This latter fragment was prepared for the ligation 
by using PGR techi^^ogy to insert an EcoRI site just upstream of the 
foUowing rabbit sequ^^ce (ACAAGACCGTrGCACCCTC) (SEQ ID 
NO:5). Cleavage at this^RI site, foOowed by fiHing-in, permitted 
bhint-end^ligation to the iVbH site of^^f«^^so that the two open 
reading fr>ames were joined inW same reading frame. The 3' end of the 
rabbit cDNA insert was cut with and joined to the HindHI site of 
pGEM4 by blunt-end Ugation (the cut end was trimmed and the 
Hindn cut end was fiUed in). The vec^^ sequences are from pGEM4 
and include sequences extending to a uniqii^ Nhel site that was joined by 
sticky-end ligation to an Spel site adjacent to^e cytomegalovirus 
promoter (P-CMV) from pcDNAneo. The other end of the P-CMV 
region was -t^Jh^^^^^ the top of 

the map (See-^Q lD NO ,6, KQsrSTTandiB). ' \ 



^Expression of the hybrid protein (SEQ. ID. NO:7, FIGs. 
^^ArB&Ri^^Si^^has been demonstrated both in Vero cells and in CHO-Kl 
cells. The proteiiN^^ecreted into the medium, as predicted, since it 
should have a signal sequence for translocation into the ceD's secretory 
pathway but has no membran^^anning region to anchor it to a 
membrane. The hybrid protein is r^ja^y detected on Western blots by 
use of commercially avaflable antibodies^p^ific for the constant regions 
of rabbit IgG. 

Treatment of the hybrid protein with various glycosidases 
(endoH, endoF and endoO) has revealed that the protein carries N- 
linked glycans of the complex ^e, which is characteristic for secreted 
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proteins, and also carries O-linked glycans, as predicted. This hybrid 
'twS^ ^ *° screen for mouse monoclonal antibodies specific for 
^HVER and to identify HSV proteins with which it might interact 



The hybrid gene is contained in two different expression 
plasmids, the latter of which contains a selectable marker for obtaining 
transformed cells that stably cany the plasmid. Transfection of these 
plasmids into cells has revealed expression of a hybrid polypeptide of 
molecular weight approximately 60»000 after dissociation into its 
component chains. 

This hybrid polypeptide, designated^^^l^KVFc, carries N- 
linked glycans and is expressed as a dimer held together by disulfide 
bonds (this is characteristic of hybrid proteins prepared with IgG 
domains that can dimerize to form the Fc region). Commercial^ 
available antibodies specific for rabbit IgG were used to detect^HVHVFc 
in Western blots and in ELISA assays. 

The observed apparent size of the hybrid protein is similar 
to the size predicted, provided the predicted molecular weight includes 
about 10,000 for the added carbohydrate. 

Evidence has been obtained that^p^^BR is only one of 
several cell surface receptors that can mediate the entiy of HSV-1 and 
HSV-2 into cells and that functional use of^HVER (and perhaps other 
receptors) is determined by the structure of the virion envelope 
glycoprotein gD. A mutant of HSV-l(KOS), designated HSV-l(KOS)ridl 
has a single amino acid substitution in gD that confers resistance to gD- 
mediated interference with HSV infection and alters slightly the ability of 
this virus, relative to the wild-type parental strain, to penetrate into 
various cell types including CHO cells and human cells. By use of a 
mutant strain of HSV.l(KOS) that is deleted for gD and complemented 
by replication in cells expressing either the wild-type or mutant form of 
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gD it has been shown that expression renders CHO cells fully 

susceptible to infection by^virus carrying ^d-type gD but not to infection 
by virus carrying mutant gD, despite the fact that both viruses could 
infect human cells with nearly equal efficiency. The implications of this 
result are several-fold. 

First, the result shows that the structure of gD determines 
whether M&ta»i be used as a receptor for entry, raising the possibility 
of a direct physical interaction. This is consistent with knowledge that 
gD is one of at least four envelope glycoproteins required for HSV entry. 
Second, althoug^ff^ is expressed in cultured human cells, such as 
HeLa cells (the cDNA Kbrary used was prepared from HeLa cells), there 
must be other receptors expressed in human cells that can facilitate the 
entry of HSV-l(KOS) carrying the mutant form of gD. Third, because 
CHO-Kl cells are so resistant to HSV-l(KOS) carrying the ridl form of 
gD, it is possible to use the gD-negative mutant of HSV-l(KOS), which 
expresses beto-galactosidase and can be complemented with the ridl 
form of gD, to screen for expression of the human gene or genes that 
can facilitate the entry of HSV-l(KOS)ridl into CHO-Kl cells. 

The possibility exists that several members of the 
TNFR/NGFR family can serve as receptors for entry of HSV-1, HSV-2 
or other herpesviruses, and that the particular receptor favored by a 
given herpesvirus or strain is determined at least in part by the structure 
of gD. 

Expression of^HVER in CHO-Kl cells significantly 
enhances the entry of at least two HSV-1 strains. Because the original 
CHO cells are folly susceptible to entryofAe HSV-2 strains tested, it is 
not possible to assess directly whetherjp^ has any effect on HSV-2 
entry into CHO-Kl ceDs. Cells expressing HSV gD become resistant, 
however, to HSV-1 and HSV-2 infection, and also to infection with 
related alphaherpesviruses, because of a block to penetration (binding is 
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unimpaired by gD expression). This phenomenon has been caUed gD- 
mediated interference 

The fact that HSV-1 gD can interfere with infection by 
HSV-1. HSV-2 or other herpesviruses implies that aU the herpesviruses 
may use an overlapping set of receptwrefor entiy. Transient expression of 
gD in CHO cells already expressin^lw^ renders the ceUs resistant to 
HSV-l(KOS) entry. Both the HSV-2 and HSV-1 forms ^^*yP^ eP 
are able to interfere with ability of HSV-1(K^^ use/f^R for entry, 
suggesting that both forms can interact witlj^IfVER for interference and 
possibly also for entiy. As predicted from the hypothesis about the 
mechanism of interference, the ridl form of gD is impaired m ability to 
mediate interference ii^ff^-expressing CMO cells (consistent with the 
finding that vims carrying the ridl fonn of gD is impaired in ability to 



enter HWraR.-e3q)ressing CHO cells). 
A 

The interference activity of quantitated by 

transfecting gD-expressigilasmids into HVER-expressing CHO cells 
and then exposing the cdls to HSV-l(KOS)gL86 to detennine whether 
the cells are susceptible or resistant to viral entry. This provides an assay 
for testing the mterference activity of various fD mutants, in order to 
define the structural features of fD that are required for interference. A 
number of mutant forms of gD produced by others (G. Cohen and R. 
Eisenberg of the Univ. of Pennsylvania) have aheady been tested by 
others to determine whether the mutations alter gD function in the virion 
(function required for viral entry into cells). These same mutant forms 
of gD (provided by G. Cohen and R. Eisenberg) are being tested for 
their interference actibity. Results obtained to dateare summarized in 
FIG. 10. CHO-A12 cells, which stably express^HVER, were plated on 6- 
wcll dishes and transfected with one of several plasmids that express 
different forms of gD. At 24 hr after transfection. the cells were replated 
in 96-well plates and 12 hr later they were exposed to HSV-l(KOS)gL86 
at several concentrations. At 6 hr after adding virus, the cells were 



solubilized and /3-galactosidase substrate was added* The colored 
product was quantitated by spectromophotometiy. Selecting the values 
obtained (OD410) at a dose of virus where the amount of virus adde^^^ 
^ was directly proportional to the amount of ^-galactosidase detected, 

5 data were normalized for comparison by dividing the )^values obtained 

for cells transfected with a gD-expressing plasmid by the value obtained 
for cells transfected with a control plasmid (control). The forms of gD 
e;q>ressed by the various plasmids were wild-type gD-1 (pRE4), which is 
369 amino adds in length, wild-type gD-2 (pWW65), mutant gD-1 
10 deleted for amino acids 196-207 (pWW13), mutant gl>-l deleted for 
amino acids 234-244 (pWW17), mutant gl>-l deleted for amino adds 
194-287 (pWW49), mutant gD-1 deleted for amino acids 234-287 
(pWW52), mutant gD-1 deleted for amino acids 208-287 (p\VW61), 
mutant gD-1 with a substitution that replaces Glu with Asp at position 63 
15 (pWW62), mutant gl>-l deleted for amino acids 338-369 (pWW63) and 

mutant gD-1 with a substitution that replaces Gin with Pro at position 27 
(pMW13). Low (-galactosidase activity implies that the transfected gD 
had interference activity; high activity indicates that the transfected gD 
had reduced or no activity. All plasmids used except pMW13 were 
20 obtained from G. Cohen and R. Eisenberg (Univ. of Pennsylvania). The 
results indicate that deletions or alterations of gD between the middle 
and membrane-spanning region of the molecule eliminate interference 
activity whereas deletion of the cytoplasmic tail of gD and an amino acid 
substitution at position 63 are without effect An amino acid substitution 
25 at position 27 (the ridl mutation) reduces, but does not eliminate, 

interference activity. From the results obtained to date, it appears that 
alterations affecting the function of gD in inf ectivily also affect its 
function in interference. This is consistent with the hypothesis that gD 
interference results from competition between cell-ass«aated gD and 
virion-assodated gD for a common target, possibl^lIVER. 

An^^iVBR polypeptide of the present invention has 
numerous uses. By way of example, such a polypeptide can be used in a 
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screening assay for the identification of drugs or compounds that inhibit 
or augment the action o^&^ (e.g., agonist and antagonist to HSV 
entry into a ceU). A screening assay for the identification of such 
compound, therefore, can be established whereby the abili^ of a 
compound to alter the action of^I^^ can be determined by exposing 
cells to HSV in the presence of a polypeptide of the present invention 
and varying amounts of compounds suspected of inhibiting the activity of 

A 

... ^ . . 
The hybrid protein^HVmVFc is bemg used to unmim^^ 

rabbits for the production of polyclonal antisera specific for the^HVER 

portion of the molecule. In addition the hybrid protein is used to screen 

for hybridomas secreting antibodies specific for th^lEWBR portion (the 

mice were immunized with^^^^R-expressing CHO cells). The hybrid 

protein is used to determine whether a physical interaction between the 

hybrid protein and gD or other viral proteins can be detected. The 

hybrid protein also has use in screening expression cDNA libraries for 

natural ligands ofj^^k and screeninec^pounds for inhibitors of the 

interaction between HSV virions and flFVER. 



A 



In addition, an^-^^polypeptide of the present invention 

A- 

can be used to produce antibodies that immunoreact specifically with 
HVER Means for producing antibodies are weU known in the art An 
\ntibody directed against HVER can be a polyclonal or a monoclonal 
antibody. 

Antibodies agains^HVER can be prepared by unmunizmg 
an animal with an HVER polypeptide of the present invention. Means 
for nnmunizing animals for the production of antibodies are well known 
in the art By way of an example, a mammal can be injected with an 
inoculum that includes a polypeptide as described herein above. The 
polypeptide can be included in an inoculum alone or conjugated to a 
carrier protein such as keyhole limpet hemocyanin (KLH). The 



polypeptide can be suspended, as is well known in the art, in an adjuvant 
to enhance the immunogenicity of the polypeptide. Sera containing 
immunologically active antibodies are then produced from the blood of 
such immunized animals using standard procedures well known in the 
art 



The identification of antibodies that immunoreact 
specifically with^ffiiSR is made by e^^osing sera suspected of containing 
such antibodies to a polypeptide of' the present invention to form a 
conjugate between antibodies and the polypeptide. The existence of the 
conjugate is then determined using standard procedures well known in 
the art 



An^HVER polypeptide of the presentinvention can also be 

used to prepare monoclonal antibodies against H^^R and used as a 

A 

screening assay to identify such monoclonal antibodies. Monoclonal 
antibodies are produced from hybridomas prepared in accordance with 
standard techniques such as that described by Kohler et al. (Nature, 
256:495, 1975). Briefty, a suitable mammal (e.g., BALB/c mouse) is 
immunized by injection with a polypeptide of the present invention. 
After a predetermined period of time, splenocytes are removed from the 
mouse and suspended in a cell culture medium. The splenocytes are 
then fused with an immortal cell line to form a hybridoma. The formed 
hybridomas are grown in cell culture and screened for their abilily to 
produce a monoclonal antibody against^^iV^. Screening of the cell 
culture medium is made with a polypeptide of the present invention. 

IV. Method of Makin^H\^ 

^ In another aspect, the present invention provides a process 
of makin^HVER. In accordance with that process, a suitable host cell is 
transformed with a pofynucleotide of the present invention. The 
transformed cell is maintained for a period of time sufficient for 
expression of th^ffVER. The forme^^^IVBR is then recovered. 

29 



Means for transformuig host cells in a manner such that 
those cells produce recombinant polypeptides are well known in the art. 
Briefly, a polynucleotide that encodes the desired polypeptide is placed 
into an expression vector suitable for a given host cell. That vector can 
be a viral vector, phage or plasmid. In a preferred embodiment, a host 
cell used to produc^l¥^BR is an eukaryotic host cell and an expression 
vector is an eukaryotic expression vector (i.e., a vector capable of 
directing egression in a eukaryotic cell). Such eukaryotic e3q>ression 
vectors are well known in the art 

In another embodiment, the host cell is a bacterial cell. An 
especially preferred bacterial cell is an E. coU. Thus, a preferred 
e]q>ression vector is a vector capable of directing expression in E. coli, 

A polynucleotide of an expression vector of the present 
invention is preferably operatively associated or linked with an enhancer- 
promoter. A promoter is a region of a DNA molecule typically within 
about 100 nucleotide pairs in front of (upstream of) the point at which 
transcription begins. That region typically contains several types of DNA 
sequence elements that are located in simOar relative positions in 
di^erent genes. As used herein, the term "promoter" includes what is 
referred to in the art as an upstream promoter region or a promoter of a 
generalized RNA polymerase transcription unit 

Another type of transcription regulatory sequence element 
is an enhancer. An enhancer provides specificity of time, location and 
expression level for a particular encoding region (e.g., gene). A major 
function of an enhancer is to increase the level of transcription of a 
coding sequence in a cell that contains one or more transcription factors 
that bind to that enhancer. Unlike a promoter, an enhancer can function 
when located at variable distances from a transcription start site so long 
as the promoter is present 



As used herein, the phrase "enhancer-promoter" means a 
composite unit that contains both enhancer and promoter elements. An 
enhancer promoter is operatively linked to a coding sequence that 
encodes at least one gene product As used herein, the phrase 
"operatively linked" or its grammatical equivalent means that a regulatory 
sequence element (e.g. an enhancer-promoter or transcription 
terminating region) is connected to a coding sequence in such a way that 
ithe transcription of that coding sequence is controlled and regulated by 
that enhancer-promoter. Means for operatively linkmg an enhancer- 
promoter to a coding sequence are well known in the art 

An enhancer-promoter used in an expression vector of the 
present invention can be any enhancer-promoter that drives expression in 
a host cell. By employing an enhancer-promoter with well known 
properties, the level of expression can be optimized. For example, 
selection of an enhancer-promoter that is active in specifically 
transformed cells permits tissue or cell specific expression of the desired 
product Still further, selection of an enhancer-promoter that is 
regulated in response to a specific physiological signal can permit 
inducible expression. 

A coding sequence of an expression vector is operatively 
linked to a transcription terminating region. RNA polymerase 
transcribes an encoding DNA sequence through a site where 
polyadenylation occurs. Typically, DNA sequences located a few 
hundred base pairs downstream of the pofyadenylation site serve to 
terminate transcription. Those DNA sequences are referred to herein as 
transcription-termination regions. TTiose regions are required for 
efficient polyadenylation of transcribed messenger RNA (mRNA). 
Enhancer-promoters and transcription-terminating regions are well 
known in the art The selection of a particular enhancer-promoter or 
transcription-terminating region will depend, as is also weU known in the 
art, on the ceU to be transformed. 




A clone of the human form of^HVBR was identified by 
DNA sequence analysis as set forth above. This clone was used in all 
subsequent expression studies.^ ffVER was expressed in CHO-Kl cells 
under the control of a human cytomegalovirus promoter. 



Expression vectors containing the encoding DNA sequence 
for all or a portion of humas^W^R are designated pBEC580, pBEClO, 
and pBL58. Both vectors were deposited, under the terms of the 
Budapest Treaty, on July 28, 1995 in the American Type Culture 
Collection, 12301 ParWawn Drive, Rockville, MD 20852, and have been 
assigned ATCC Accession Nos: ^^f2^BEC580), 97f^BEC10), and 

The present invention also contemplates a host cell 
transformed with a polynucleotide or expression vector of this invention. 
Means for transforming cells and polynucleotides and expression vectors 
used to transform host cells are set forth above. Preferably, the host cell 
is an eukaiyotic host cell such as a mammalian cell or a prokaryotic cell 
such as an E, coli, 

V. Pharmaceutical Compositions 

The present invention also provides a pharmaceutical 
composition comprising a polypeptide or a polynucleotide of this 
invention and a physiologically acceptable diluent 

In a preferred embodiment, the present invention includes 
one or more antisense oligonucleotides or polypeptides, as set forth 
above, formulated into compositions together with one or more non-toxic 
physiologically tolerable or acceptable dfluents, carriers, adjuvants or 
vehicles that are coUectively referred to herein as diluents, for parenteral 
injection, for oral administration in solid or liquid form, for rectal or 
topical administration, or the like. 




The compositions can be administered to humans and 
animals either orally, rectafly, parenterally, intradstemally, intravaginally, 
intraperitoneally, locally, or as a buccal or nasal spray. 



Compositions suitable for parenteral administration can 
comprise physiologically acceptable sterile aqueous or non-aqueouis 
solutions, dispersions, su^ensions or emulsions and sterile powders for 
reconstitution into such sterile solutions or dispersions. Examples of 
suitable dQuents inchide water, ethanol, polyols, suitable mixtures 
thereof, vegetable oils and injectable organic esters such as ethyl oleate. 
Pr<qier fluidity can be maintained, for example, by the use of a coating 
such as lecithin, by the maintenance of the required particle size in the 
case of dispersions and by the use of surfactants. 

Compositions can ako contain adjuvants such as preserving, 
wetting, emulsifying, and dispensing agents. Prevention of the action of 
microorganisms can be insured by various antibacterial and antifungal 
agents, for example, parabens, chlorobutanol, phenol, soibic add, and the 
like. It may also be desirable to include isotonic agents, for example, 
sugars, sodium chloride and the like. Prolonged absorption of the 
injectable pharmaceutical form can be brought about by the use of agents 
delaying absorption, for example, aluminum monostearate and gelatin. 

Besides such inert diluents, the composition can also 
include sweetening, flavoring and perfuming agents. Suspensions, in 
addition to the active compounds, may contam suspending agents, as for 
example, ethoxylated isostearyl alcohols, polyoxyethylene soAitol and 
sorbitan esters, microcrystalline celhilose. aluminum metahydroxide, 
bentonit, agar-agar and tragacanth. or mixtures of these substances, and 
the like. 

The invention has been described in terms of preferred 
embodiments. One of ordinary skill in the art readily appredates that 
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changes and modifications can be made to those embodiments without 
departing from the true scope of this invention. 



SEQUENCE LISTING 



(1) GENERAL INFORMATION: 
(i) 



►PLICANT: SPEAR, Patricia G. 

MONTGOMERY, Rebecca I. 



(ii) TI-n\OF INVENTION: HERPES VIRUS ENTRY RECEPTOR PROTEIN 
(iii) NUMBERNoE SEQUENCES: 7 

(IV) ^??^SIseI:'^R^^^ goldsmith, SHORE MILNAMOW 
STREit: 180 M. STET&)M, SUITE 4700 
CITY:\HICAG0 
STATE: NILLINOIS 

countryKu.s.a 




(V) COMPUTER READA^ FORM: 

^ ' (A) MEDIUM TYP§t Floppy dis^Ki^ 

COMPUTER: IBM PC compatible 

igj KiS? r!l|in'g;»n. V,..!.. ...30 

(vi) CURRENT APPLICATION WtA: 
IVi) APPLICATION NUuSkR: 

(B) FILING DATE: ^ 

(C) CLASSIFICATION: 

(viii) ATTORNEY/AGENT INFORMATIC 
^ ' (A) NAME: NORTHRUP, THOMAY^E. 

B REGISTRATION NUMER: ||,268 
(C) REFERENCE/DOCKET NUMBER^ XX 



(ix) TELECOMMUNICATION INFORMATION: 
y^^^f '^^v TELEPHONE: (312) 616-5400 
TELEFAX: (312) 616-5460 
TELEX: XX 

(2) INFORMATION FOR SEQ ID N0:1: 

(i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1719 base pairs 
I B) TYPE: nucleic acid 
C) STRANDEDNESS: single 
I D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(ix) FEATURE: 

^ ' (A) NAME/KEY: matjpeptide 
|b{ LOCATION: 293,. 1189 

(ix) FEATURE: 

^ ' (A) NAME/KEY: CDS 

(B) LOCATION: 293.-1192 

(ix) FEATURE: ^ 
^ ' (A) NAME/KEY: sigjeptide 
(B) LOCATION: 293. .406 



(xi) SEQUENQE ©ESpRIPTION; SEO IP N0:1: 
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CCTTCATACC GGCCCTTCCC CTCGGCTTTG .CCTGGACAGC 
CCTGTGTCCC CCAGCGCCGC TCCACCCAGC AGGGCTGAGG 
CCCCTGCTGC CCACtCTCCT GCT6CTCGGG TTCTGAG6CA 
GATTCTCTTT CTCfTTCTCT TCTGGCCCAC AGCCGCAGCA 
TGGAGTTCAT GCTGCfAGCT GGGTTCCC6A GCTGCCGGTC 



GAG CCT CCT GGA GAC ^GG GGG CCT CCT CCC TGG 
Glu Pro Pro Gly Asp Trp Gly Pro Pro Pro Trp 
5 \ 10 

ACC GAC GTC TTG AGG CTG\GTG CTG TAT CTC ACC 
Thr Asp Val Leu Arg Leu Val Leu Tyr Leu Thr 
20 \ 25 

TGC TAG GCC CCA GOT CTG Ccb TCC TGC AAG GAG 
Cys Tyr Ala Pro Ala Leu ProXSer Cys Lys Glu 
35 40 ^ 

GGC TCC GAG TGC TGC CCC ACG TGC AGT CCA GGT 
Gly Ser Glu Cys Cys Pro Thr Cy\ Ser Pro Gly 
50 55 \ 60 

GCC TGC GGG GAG CTG ACG GGC ACA (JtO TGT GAA 
Ala Cys Gly Glu Leu Thr Gly Thr Vskl Cys Glu 
ro \ 75 

ACC TAG ATT GCC CAC CTC AAT GGC CTA \GC AAG 
/ Thr Tyr He Ala His Leu Asn Gly Leu S^r Lys 
85 90 

ATG TGT GAC CCA GCC ATG GGC CTG CGC GCGNaCG 
Met Cys Asp Pro Ala Met Gly Leu Arg Ala Thr 
100 105 

ACA GAG AAC GCC GTG TGT GGC TGC AGC CCA GGft 
Thr Glu Asn Ala Val Cys Gly Cys Ser Pro Gly' 
115 120 



TCTGCCTCCC 6CAGGGCCCA 

CCCTCTCTGC TGCCAGACAC 

CAGCTTGTCA CACCGAGGCG 

ATGGCGCTGA GTTCCTCTGC 

TGA6CCTGAG GC ATG 
Met 
1 

AGA TCC ACC CCC AGA 
Arg Ser Thr Pro Arg 
15 

TTC CTG GGA GCC CCC 
Phe Leu Gly Ala Pro 
30 

GAC GAG TAG CCA GTG 
Asp Glu Tyr Pro Val 



TAT CGT GTG AAG GAG 
Tyr Arg Val Lys Glu 
65 

CCC TGC CCT CCA GGC 
Pro Cys Pro Pro Gly 
80 

TGT CTG CAG TGC CAA 
Cys Leu Gin Cys Gin 
95 

GGG AAC TGC TCC AGG 
Arg Asn Cys Ser Arg 
110 

CAC TTC TGC ATC GTC 
^His Phe Cys He Val 

125 

CCA CCT CCA GCC 
Pro Pro Pro Ala 
145 

GTC AGG ACA CCC TGT 
Val A^g Thr Pro Cys 
160 

GGG ACC "CT^ ^ 
Gly Thr ifu Glu Glu 



CAG GAC GGG GAC CAC TGC GCC GGT GCC GCC GTT 
Gin Asp Gly Asp His Cys Ala Gly Ala Ala Val 
130 135 140 

CGG GCC AGA GGG TGC AGA AGG GAG GCA CCG AGA 
Arg Ala Arg Gly Cys Arg Arg Glu Ala Pro Arg 
150 155 

GTC AGA ACT GCC CCC GGG GAC CTT CTC TCC AAT 
Val Arg Thr Ala Pro Gly Asp Leu Leu Ser Asn 
165 170 

TGT CAG CAC CAG ACC AAG TGC AGC TGG CTG GTG 
Cys Gin His Gin Thr Lys Cys Ser Trp Leu Val 
180 185 

GGG ACC AGC AGC TCC CAC TGG GTA TGG TGG TTT 
Gly Thr Ser Ser Ser His Trp Val Trp Trp Phe 
195 200 

GTC ATC GTC ATT GTT TGC TCC ACA GTT GGC CTA 
Val He Val He Val Cys Ser Thr Val Gly Leu 
210 215 220 

AGA AGA AAG CCA AGG GGT GAT GTA GTC AAG GTG ATC GTC TCC GTC CAG 



ACG AAG GCC 
Thr Lys j 
190 



6CT 
ily Ala 



,GGA 



CTC TCA GGG AG , 
Leu Ser Gly Ser 
205 

ATC ATA TGT GTG ^ 
He He Cys Val Lj 



60 
120 
180 
240 
295 

343 

391 

439 

487 

535 

583 

631 

679 

727 

775 

823 

871 

919 

967 

1015 
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Arg Arg Lys Pro Arg Gly Asp Val Val Lys Val 
230 235 

CGG AAA AGA GAG GAG GCA GAA GGT GAG GOG ACA 
Arg Lys Ara Gin Glu Ala Glu Gly Glu Ala Thr 

\24S 250 

GAG GOG OCT feCG GAG GTC ACC ACG GTG GCC GTG 
Gin Ala Pro Fro Asp Val Thr Thr Val Ala Val 
260 \ 265 

CAT TCA CGG GGAXGGA GCC CAA ACC ACT-GAC CCA 
His Ser Arg Gly Gly Ala Gin Thr Thr Asp Pro 
275 \ 280 



He Val Ser Val Gin 
240 

GTC ATT GAG GCC GTG 
Val He Glu Ala Leu 
255 

AGG AGA CAA TAG CCT 
Arg Arg Gin Tyr Pro 

GAG ACT GTG CAC CCC 
Gin Thr Leu His Pro 
285 



GAG GCC AGA GAT AdC TGG AGC GAG GGC TGC TGA AAGAGGCTGT CCACCTGGCG 
Asp Ala Arg Asp Thi\Trp Ser Asp Gly Cys * 
290 \295 300 

AAACCACCGG AGCCCGGAGQ^ CTTGGGGGCT CCGCCCTGGG CTGGCTTCCG TCTCCTCCAG 
TGGAGGGAGA GGTGGGGCCC tTGCTGGGGT AGAGCTGGGG ACGCCACGTG CCATTCCCAT 
GGGCCAGTGA GGGCCTGGGG c\tCTGTTCT GCTGTGGCCT GAGCTCCCCA GAGTCCTGAG 
GAGGAGCGCC AGTTGCCCCT CGCtCACAGA CCACACACCC AGCCCTCCTG GGCCAGCCCA 
GAGGGCCCTT CAGACCCCAG CTG7t?TGCGC GTCTGACTCT TGTGGCCTCA GCAGGACAGG 
CCCCGGGCAC TGCCTCACAG CCAAG^^CTGG ACTGGGTTGG CTGCAGTGTG GTGTTTAGTG 
GATACCACAT CGGAAGTGAT TTTCTAAATT GGATTTGAAT TCCGGTCCTG TCTTCTATTT 
GTCATGAAAC AGTGTATTTG GGGAGATGQT GTG6GAGGAT GTAAATATCT TGTTTCTCCT 

CAAAAAAAAA aaaaaaaaaa aaaaaaa 



1063 

1111 

1159 

1212 

1272 
1332 
1392 
1452 
1512 
1572 
1632 
1692 
1719 



(2) INFORMATION FOR SEQ ID N0:2: 

\ 

(i) SEQUENCE CHARACTERISTICSX 

(A) LENGTH: 299 amino ^ids 



SB) TYPE: amino acid 
D) TOPOLOGY: linear 



\ 



(ii) MOLECULE TYPE: protein \ 

(xi) SEQUENCE DESCRIPTION: SEQ ID N^:2: 

Met Glu Pro Pro Gly Asp Trp Gly Pro Pro Pro\Trp Arg Ser 
1 5 10 \ 

Arg Thr Asp Val Leu Arg Leu Val Leu Tyr Leu jk\ 
20 25 

Pro Cys Tyr Ala Pro Ala Leu Pro Ser Cys Lys Glu 
35 40 

Val Gly Ser Glu Cys Cys Pro Thr Cys Ser Pro Gly 
50 55 60 

Glu Ala Cys Gly Glu Leu Thr Gly Thr Val Cys Glu 
65 70 76 

Gly Thr Tyr He Ala His Leu Asn Gly Leu Ser Lys 
85 90 

Gin Met Cys Asp Pro Ala Met Gly Leu Arg Ala Thr 
100 105 



Thr Pro 
15 




Arg Asn 
110 



s Ser 
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Arg Thr Glu Asn Ala Val Cys Gly Cys Ser Pro 
115 120 

Val Gin Asp Gly Asp His Cys Ala Gly Ala Ala 
130 "I^S 

Ala Arg Ala Arg Gly Cys Arg Arg Glu Ala Pro 
145 ^ "ISO 

Cys Val Arg Th\Ala Pro Gly Asp Leu Leu Ser 



Glu Cys Gin His 
180 



Thr Lys Cys Ser Trp Leu 
185 



Gly His 
125 

Val Thr 
140 

Arg Val 
Asn Gly 
Val Thr 



Ala Gly Thr Ser SeAser His Trp Val Trp Trp 
195 \ 200 

Leu Val He Val He v\l Cys Ser Thr Val Gly 
210 \ 215 

Lys Arg Arg Lys Pro Arg\ly Asp Val Val Lys 

225 230 \ 23& 

Gin Arg Lys Arg Gin Glu Afa Glu Gly Glu Ala 
245 \ 

Leu Gin Ala Pro Pro Asp Val\hr Thr Val Ala 
260 \ 265 

Pro His Ser Arg Gly Gly Ala Gl\ Thr Thr Asp 
275 280\ 

Pro Asp Ala Arg Asp Thr Trp Ser Asp Gly Cys 
290 295 \ 

(2) INFORMATION FOR SEQ ID N0:3: 

(i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base paxrs 

(B) TYPE: nucleic acid 
)c> STRANOEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:3> 

AACCCGGCTC GAGCGGCCGC T 

(2) INFORMATION FOR SEQ ID N0:4: 

(i> SEQUENCE CHARACTERISTICS: 



Phe Leu 

205 

Leu He 
220 



Phe Cys He 

Pro Pro Pro 

Arg Thr Pro 
160 

Thr Leu Glu 
175 

Lys Ala Gly 
190 

Ser Gly Ser 
He Cys Val 



Val He 
Thr Val 
Val Arg 

Pro Gin 

285 



Val Ser Val 
240 

He Glu Ala 

255 

Arg Gin Tyr 
270 

Thr Leu His 



LENGTH: 22 base pairs 
TYPE: nucleic acid 
STRANOEDNESS: single 
TOPOLOGY: linear 



(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID N0:4: 
GAATTCCACC ACACTTAAGG TG 

(2) INFORMATION FOR SEQ ID NO: 5: 
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fi) SEOUENCE CHARACTERISTICS: 

(A) LENGTH: 19 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(11) MOLECULE TYPE: OUA 
(xi) SEobfNCE DESCRIPTION: SEQ ID N0:5: 
ACAAGACCGT TGG^CCCTC 

(2) information vor seq id n0:6: 
(1) sequenceXcharacteristics: 



LENCaTH: 4619 base pairs 
TYPeK nucleic acid 
STRANDEDNESS: single 
TOPOLOGY: linear 



(11) MOLECULE TYF^: cDNA 

(ix) FEATURE: 

(A) NAME/KEY:\CDS 

(B) LOCATION: 1^4,. 1320 



(ix) FEATURE: 
^ (A) NAME/KEY: mat 

(B) LOCATION: 64. 



peptide 
ri317 



SEQ ID N0:6: 



(xi) SEQUENCE DESCRIPTIOh 
AA6CTTGCAT GCCT6CAGGT CGACTCTaIsC TGGGTTCCCG AGCTGCCGGT CTGAGCCTGA 



6GC ATG GAG OCT OCT GGA GAC TGG'^G 
y Met Glu Pro Pro Gly Asp Trp aj.y 



CCC AGA ACC GAC GTC TTG AGG CTG GTi 
Pro Arg Thr Asp Val Leu Arg Leu Val 
20 

6CC CCC TGC TAC GCC CCA 6CT CTG CCG T 
Ala Pro Cys Tyr Ala Pro Ala Leu Pro Se 
3' 
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CCT CCT CCC TGG AGA TCC ACC 
Pro Pro Pro Trp Arg Ser Thr 
10 15 

,CTG TAT CTC ACC TTC CTG GGA 
eu Tyr Leu Thr Phe Leu Gly 

TGC AAG GAG GAC GAG TAC 
^Cys Lys Glu Asp Glu Tyr 
45 



CCA GTG 6GC TCC GAG TGC TGC CCC ACG TGC ^T CCA GGT TAT CGT GTG 
Pro Val Gly Ser Glu Cys Cys Pro Thr Cys s5r Pro Gly Tyr Arg Val 
50 55 \ 



AAG GAG GCC TGC GGG GAG CTG ACG GGC ACA GTG 
Lys Glu Ala Cys Gly Glu Leu Thr Gly Thr Val 
65 70 

CCA GGC ACC TAC ATT GCC CAC CTC AAT GGC CTA 
Pro Gly Thr Tyr He Ala His Leu Asn Gly Leu 

80' 85 9U 

TGC CAA ATG TGT GAC CCA GCC ATG GGC CTG CGC 
Cys Gin Met Cys Asp Pro Ala Met Gly Leu Arg 
' 100 105 



TGT GAA CCC TGC CCT 
^s Glu Pro Cys Pro 

AGC\AAG TGT CTG GAG 
Ser Vys Cys Leu Gin 
95 

6CG AC^ CGG AAC TGC 
Ala Thi\Arg Asn Cys 
^ 110 



rr^ AGG ACA GAG AAC GCC GTG TGT GGC TGC AGC CCA GGC CXC TTC TGC 
S^ A% TM^ GlS A^n Al^: Val Cys Glv Cys Ser Pro Gly Hg Phe Cys 
115 120 

ATC GTC CAG GAC GGG GAC CAC TGC GCC GGT GCC GCC GTT ACG \CA CCT 
lie Val Gin Asp Gly Asp His Cys Ala Gly Ala Ala Val Thr Pro Pro 



60 
108 

156 

204 

252 

300 

348 

396 



492 



39 



130 



135 



140 



S5 ss ss s?s ^ x?s s eis s 5^ 5Ii ^ tSi 

145 

SI SS il?5 S S^S §S SI flS if. i^I IS « Eg 

160 ,165 



GAG 6AA TGT 
Glu Glu Cys 



Gln> 



CAC CAG ACC AAG TGC AGA ATT CAC AAG ACC GTT GCA 
His Gin Thr Lys Cys Arg lie His Lys inr vax 
Si 80 

AAG CCC ACG TGC CCA CCC CCT GAA CTC CTG GGG 
Lys Pro Thr Cys Pro Pro Pro Glu Leu Leu uxy 
200 



tie 



Gil 



r 



CCC TCG ACA TGC 
Pro Ser Thr Cys 
195 

GGA CCG TCT GTC TTC' 
Glv Pro Ser Val Phe 
210 

ATC TCA CGC ACC CCC 
He Ser Arg Thr Pro 
225 

GAT GAC CCC GAG GTG CAG 
Asp Asp Pro Glu Val Gin 
240 24& 

CGC ACC GCC CGG CCG COG CTA 
Arg Thr Ala Arg Pro Pro Leu 

CGC GTG GTC AGC ACC CTC CCC ATC' 
Arg Val Val Ser Thr Leu Pro He 
275 

AAG GAG TTC AAG TGC AAA GTC CAC 
Lys Glu Phe Lys Cys Lys Val His 
' 290 

GAG AAA ACC ATC TCC AAA GCC AGA 
Glu Lys Thr He Ser Lys Ala Arg 
305 



Z^TC TTC CCC CCA AAA CCC AAG GAC ACC CTC ATG 
^ Phe Pro Pro Lys Pro Lys Asp Thr Leu Met 
215 220 

GTC ACA TGC GTG GTG GTG GAC GTG AGC CAG 
vll Thr cys Val Val Val Asp Val Ser Gin 
' 230 235 

ACA TGG TAC ATA AAC AAC GAG CAG GTG 
Thr Trp Tyr He Asn Asn Glu Gin Val 
250 

GAG CAG CAG TTC AAC AGC ACG ATC 
Glu Gin Gin Phe Asn Ser Thr He 
265 270 

^CG CAC CAG GAC TGG CTG AGG G6C 
thr His Gin Asp Trp Leu Arg Gly 
50 285 

AA(\AAG GCA CTC CCG GCC CCC ATC 
Asn\ys Ala Leu Pro Ala Pro He 

GGG CA^ CCC CTG GAG CCG AAG GTC 
Glv Gln\Pro Leu Glu Pro Lys Val 
^ ^ 315 

^ K? IS J?S IS SS II S J?S iS? 5S f 

320 ^25 



,C ATC TCG GTG GAG 
p He Ser Val Glu 
^ 350 



TGG GAG AAG AAC GGG AAG GCA GAG GAC AAC TAC 
Trp Glu Lys Asn Gly Lys Ala Glu Asp Asn lyr 
355 



AA<X ACC ACG CCG GCC 
LysXThr Thr Pro Ala 
^ 365 



GTG CTG GAC AGC GAC GGC TCC TAC TTC CTC TAC AAC CTC TCA GTG 

Val Leu Asp Ser Asp Gly Ser lyr rne i.«u y 
370 



Cys 



iS? SS; SS ??5 SS 115 ?S f S 

385 

as iiS SS K Tfr Tr SS tf. IS? ?K h, 

400 



_J GTG ATG 
Ser Val Met 



CGC TCT 
V Arg Ser 
415 



540 



588 



636 



684 



732 



780 



828 



876 



924 



972 



1020 



1068 



1116 



1164 



1212 



1260 



1308 
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CCG GGT AAA TGA GCGCTGTGCC GGCGAGCTGC CCCTCTCCCT CCCCCCCACG 1360 
Pro Gly Lys * 

CCGCAGCTGT GCACCCCQCaXcACAAATAAA GCACCCAGCT CTGCCCTGAA CAGCTTCCGG 1420 

TCTCCCTATA GTGAGTCGTA VrAATTTCGA TAAGCCAGCT GCATTAATGA ATCGGCCAAC 1480 

GCGCGGGGAG AGGCGGTTTG COTATTGGGC GCTCTTCCGC TTCCTCGCTC ACTGACTCGC 1540 

TGCGCTCGGT CGTTCGGCTG CGGtGAGCGG TATCAGCTCA GTCAAAGGCG GTAATACGGT 1600 

TATCCACAGA ATCAGGGGAT AACGQAGGAA AGAACATGTG AGCAAAAGGC CAGCAAAAGG 1660 

CCAGGAACCG TAAAAAGGCC GCGTTQCTGG CGTTTTTCCA TAGGCTCCGC CCCCCTGACG 1720 

AGGATCACAA AAATCGACGC TCAAGTCAGA GGT6GCGAAA CCCGACAGGA CTATAAAGAT 1780 

ACCAGGCGTT TCCCCCTGGA AGCTCCCTfcG TGCGCTCTCC TGTTCCGACC CTGCCGCTTA 1840 

CCGGATACCT GTGCGCCTTT CTCCCTTCG^ GAAGCGTGGC GCTTTCTCAT AGGTCACGCT 1900 

GTAGGTATCT CAGTTCGGTG TAGGTCGTTcXgCTCCAAGCT GGGCTGTGTG CACGAACCCC 1960 

CCGTTCAGCC CGACCGCTGC GCCTTATCCG OTAACTATCG TCTTGAGTCC AACCCGGTAA 2020 

GACACGACTT ATCGCCACTG GCAGCAGCCA CTOGTAACAG GATTAGCAGA GCGAGGTATG 2080 

TAGGCGGTGC TACAGAGTTC TTGAAGTGGT GGCCTAACTA CGGCTACACT AGAAGGAGAG 2140 

TATTTGGTAT CTGCGCTCTG CTGAA6CCAG TTaActTCGG AAAAAGAGTT GGTAGCTCTT 2200 

GATCCGGCAA ACAAACCACC GCTGGTAGCG GTGGTTTTTT TGTTTGCAAG CAGCAGATTA 2260 

CGCGCAGAAA AAAAGGATCT CAAGAAGATC CTTTGAJCTT TTCTACGGGG TCTGACGCTC 2320 

AGTGGAACGA AAACTCACGT TAAGGGATTT TGGTCATOAG ATTATCAAAA AGGATCTTCA 2380 

CCTAGATCCT TTTAAATTAA AAATGAAGTT TTAAATCAaT^AAAGTATA TATGAGTAAA 2440 

CTTGGTCTGA CAGTTACCAA TGCTTAATCA GTGAGGCAGC^TATCTCAGCG ATCTGTCTAT 2500 

TTCGTTCATC CATAGTTGCC TGACTCCCCG TCGTGTAGAT /^CTACGATA CGGGAGGGCT 2560 

D TACCATCTGG CCCCAGT6CT GCAATGATAC CGCGAGACCc\aCGCTCACCG GCTCCAGATT 2620 

TATCAGCAAT AAACCAGCCA GCCGGAAGGG CCGAGCGCAG AaGTGGTCCT GCAACTTTAT 2680 

CCGGCTCCAT CCAGTCTATT AATTGTTGCC GGGAAGCTAG AGTAAGTAGT TCGCCAGTTA 2740 

ATAGTTTGCG CAACGTTGTT GCCATTGCTA CAGGCATCGT GGTGTCACGC TCGTCGTTTG 2800 

GTATGGCTTC ATTCAGCTCC GGTTCCCAAC GATCAAGGCG AGTTSACATGA TCCCCCATGT 2860 

TGTGCAAAAA AGCGGTTAGC TCCTTCGGTC CTCCGATOiT TGTC;^GAAGT AAGTTGGCCG 2920 

CAGTGTTATC ACTCATGGTT ATGGCAGCAC TGCATAATTC TCTTACTGTC ATGCCATCCG 2980 

TAAGATGCTT TTCTGTGACT GGTGAGTACT CAACCAAGTC ATTCTGAGAA TAGTGTATGC 3040 

GGCGACCGAG TTGCTCTTGC CCGGCGTCAA TACGGGATAA TACCGCGCCA CATAGCAGAA 3100 

CTTTAAAAGT GCTCATGATT GGAAAACGTT CTTCGGGGCG AAAACTCT(yi AGGATCTTAC 3160 

CGCTGTTGAG ATCCAGTTCG ATGTAACCCA CTCGTGCACC CAACTGATGtVtCAGCATCTT 3220 

TTACTTTCAC CAGCGTTTCT GGGTGAGCAA AAACAGGAAG GCAAAATGCC 6CAAAAAAGG 3280 

GAATAAGGGC GACACGGAAA TGTTGAATAC TCATACTCTT CCTTTTTCAA TATTATTGAA 3340 

41 



GCATTTATCA GGGTTArTGT CTCATGAGCG GATACATATT 
AACAAATAGG GGTTCCOCGC ACATTTCCCC GAAAAGTGCG 
TTATTATCAT GACATTAACC TATAAAAATA GGCGTATCAC 
GTTTCGGT6A TGACGGTGAA AACCTCTGAC ACAtGCAGCT 
GTCTGTAAGC 6GATGCCGG($ AGCAGACAAG CCCGTCAGGG 
GQTQTCGGGG GTGGCTTAAC\TATGCGGCAT CAGAGCAGAT 
TCGACGCTCT CCCTTATGCG ACTCCTGCAT TAGGAAGCAG 
GTTGAGCACC GCCGCGGCAA gWtGGTGC AAGGAGATGG 
ACGGGGCCT6 CCACCATAGC CACpCCGAAA CAA6CGCTCA 
CGATCTTCCC CATCGGTGAT GTCCSpCGATA TAGGCGCCAG 
GTGATGCCGG CCACGATGCG TCCGGCGTAG AGGATCTGGO 
TTACGGGGTC ATTAGTTCAT AGGCCAtATA TGGAGTTCCG 
ATGGCCCGCC TGGCTGACCG OCCAACGACC CCCGCCCATT 
TTCCCATAGT AACGCCAATA GGGACTTT<XC ATTGACGTCA 
AAACTGCCCA CTTGGCAGTA CATGAAGTGtV ATCATATGCC 
TCA^TGACGG TAAATGGCCC GCCTGGCATT VtGCCCAGTA 
CTACTTGGCA GTACATCTAC GTATTAGTCA TtJGCTATTAC 
AGTACATCAA TGGGCGTGGA TAGCGGTTTG AC^ 
TTGACGTCAA TGGGAGTTTG TTTTGGCACC AAA)^TCAACG 
ACAACTCCGC CCCATTGACG CAAATGGGCG GTAGfeCGTGT 
GCAGAGCTCT CTGGCTAACT AGAGAACCCA CTGcAaACT 
ACTCACTATA GGGAGACCC 

(2) INFORMATION FOR SEQ ID N0:7: 



TGAATGTATT 
ACCTGACGTC 
GAGGCCCTTT 
CCCGGAGACG 
CGCGTCAGCG 
TGTACTGAGA 
CCCAGTAGTA 
CGCCCAACAG 
TGAGCCCGAA 
CAACCGCACC 
TAGTTATTAA 
CGTTACATAA 
GACGTCAATA 
ATGGGTGGAC 
AAGTACGCCC 
CATGAGCTTA 
CATGGTGATG 
ATTTCCAAGT 
GGACTTTCCA 
ACGGTGGGAG 
GGCTTATCGA 



TAGAAAAATA 
TAAGAAACCA 
CGTCTCGCGC 
GTCACAGCTT 
GGTGTTGGCG 
GTGCACCATA 
GGTTGAGGCC 
TCCCCCGGCC 
GTGGCGAGCC 
TGTGGCGCC6 
TAGTAATCAA 
CTTACGGTAA 
ATGACGTATG 
TATTTACGGT 
CCTATTGACG 
TGGGAGTTTG 
CGGTTTTGGC 
CTCCACCCCA 
AAATGTCGTA 
GTCTATATAA 
AATTAATACG 



(1) SEQUENCE CHARACTERISTIOS: 

(A) LENGTH: 418 amino acids 

(B) TYPE: amino acid 
<D) TOPOLOGY: linear 



(11) MOLECULE TYPE: protein 

(xl) SEQUENCE DESCRIPTION: SEQ ID NO:; 

Met Glu Pro Pro Gly Asp Trp Gly Pro Pro Pro 
1 5 10 

Arg Thr Asp Val Leu Arg Leu Val Leu Tyr Leu 
20 25 

Pro Cys Tyr Ala Pro Ala Leu Pro Ser Cys Lys 
35 40 

Val Gly Ser Glu Cys Cys Pro Thr Cys Ser Pro 
50 55 



trp Arg Ser Thr Pro 
15 

Tfcr Phe Leu Gly Ala 
30 

Gl^ Asg Glu Tyr Pro 

Gljl Tyr Arg Val Lys 
60\ 



3400 
3460 
3520 
3580 
3640 
3700 
3760 
3820 
38^0 
3940 
4000 
4060 
4120 
4180 
4240 
4300 
4360 
4420 
4480 
4540 
4600 
4619 



Glu Ala Cys Gly Glu Leu Thr Gly Thr Val Cys Glu\Pro Cys Pro Pro 
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65 



70 



75 



80 



Gly Thr Tyr IleWa His Leu Asn Gly Leu Ser Lys Cys Leu Gin Cys 
^ 90 95 

Gin Met Cys Asp ffVo Ala Met Gly Leu Arg Ala Thr Arg Asn Cys Ser 
100 \ 105 , 110 

Arg Thr Glu Asn Al^Val Cys Gly Cys Ser Pro Gly His Phe Cys He 
115 \ 120 125 

Val Gin Asp Gly Asp His Cys Ala Gly Ala Ala Val Thr Pro Pro Pro 
130 \ 135 140 

Ala Arg Ala Arg Gly Cys\Arg Arg Glu Ala Pro Arg Val Arg Thr Pro 
145 150 \ 155 160 

Cys Val Arg Thr Ala Pro Gly Asp Leu Leu Ser Asn Gly Thr Leu Glu 
165 V 170 175 

Glu Cys Gin His Gin Thr LysXcys Arg He His Lys Thr Val Ala Pro 
180 \ 185 190 

Ser Thr Cys Ser Lys Pro Thr CVs Pro Pro Pro Glu Leu Leu Gly Glv 
195 200 205 

Pro Ser Val Phe He Phe Pro PrA Lys Pro Lys Asp Thr Leu Met He 
210 215 \ 220 

Ser Arg Thr Pro Glu Val Thr Cys ^1 Val Val Asp Val Ser Gin Asp 
225 230 \ 235 240 



Asp Pro Glu Val Gin Phe Thr Trp 
245 



Ty] 



He Asn Asn Glu Gin Val Arg 

l250 255 



Thr Ala Arg Pro Pro Leu Arg Glu Gin Gin Phe Asn Ser Thr He Ara 
260 265 \ 270 

Val Val Ser Thr Leu Pro He Thr His Gin Asp Trp Leu Arg Glv Lvs 
275 280 \ 285 

"-y® *-y^ ^sn Lys AlaXLeu Pro Ala Pro He Glu 

290 295 \ 000 

Lys Thr He Ser Lys Ala Arg Gly Gin Pro Ubu Glu Pro Lys Val Tvr 
305 310 3i5 320 

Thr Met Gly Pro Pro Arg Glu Glu Leu Ser SeX Arg Ser Val Ser Leu 
325 330 \ 335 

Thr Cys Met He Asn Gly Phe Tyr Pro Ser Asp Vie Ser Val Glu Tro 
340 345 \ 350 

Glu Lys Asn Gly Lys Ala Glu Asp Asn Tyr Lys Thr Thr Pro Ala Val 
355 360 \ 365 

Leu Asp Ser Asp Gly Ser Tyr Phe Leu Tyr Asn LyA Leu Ser Val Pro 
370 375 380 

Thr Ser Glu Trp Gin Arg Gly Asp Val Phe Thr Cys \er Val Met His 
390 395 A 400 

Glu Ala Leu His Asn His Tyr Thr Gin Lys Ser He s\r Arg Ser Pro 
405 410 A 415 

Gly Lys 
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WHAT IS CLAIMED IS: 



1. An isolated and purified polypeptide of about 300 aminoac 
residues comprising the amino acid residue seauence SEO'lD NO:2. 




3. A proc^^^f det^ting an antibody against HVER in a 
biological sam^ coiAprising adding the polypeptide of claim 1 to the 
aaintaining the sample for a period of time sufficient for 
lation of a conjugate between the anti body and ^ e polypeptide and 
detecting the presence of the conjugate and thereby the antibody. 



4. An isolated and puriited polynucleotide comprising a 
nucleotide sequence consisting ess^tially of a nucleotide sequence 
selected from the group consisting oK\a) the sequence of SEQ ID NO:l 
from nucleotide position 293 to about nu^eotide position 1189; b) 
sequences that are complementary to the semence of (a); c) sequences 
that, on expression, encode a polypeptide encoded by the sequence of 

(a). ..„.. 



5. The polynucleotide of ^i^/^hat is a DNA molecule. 



6. The pnlynnrleotjde of claim 5 whereia-Aemntleotide 
segueiice-is-SECr^NO: 1 . 

7. The polynucleotide of cl^ that is 



an RNA molecule. 



8. An expression vector comf rising the DNA molecule of claim 



9. The expression vector of 
enhancer-promoter operative^ linked 



cliim 8 further comprising an 
the polynucleotide. 
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10. The expression vector of claim 8 wherein^e^ 
has the nucleotide sequenceofSBaro^^<3^^ position 
293 to about nudeotiaSip^ition 1189. 



11. An oligonucl^ 
containing a nucleotide | 
identical or compleme 
polynucleotide of cl? 



Je of from about 15 to aboul^Se-liuSeotides 
uce of at l^aeflSmicleotides that is 
Dntiguous sequence of the 



tC The oUgonucleotide of claim 11 that is an antisense 
-olig nT^ilcleotide^ 



cell. 



13. A host cell transfd|rmed with the expression vector of claim, 8. 
14f The transformed hdst ceU of claim .13 that is a mammalian 

15. * The transformed host cell of claim 13^ that is a bacterial ceU. 



16. The transformed hostjftell of clahn 14 wherein the mammalian 
cell is an ovarian cell. 

17. ' The transformed host ciU of claim 16 wherein the ovarian cell 
is designated CHO-A3, CHO-All, VtHO-BS, CHO-B9, or CHO-Bll. 

18. A process of making HVER comprising transforming a host 
cell with the expression vector of clailb 8, maintaining the transformed 
cell for a period of time sufficient forlpxpression of the HVER and 
recovering the HVER. 

Ml \ 

19. The process of claim 18 whel^ein the host ceU is an eukaiyotic 
host cell. 
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1^ 

*^0C The process of claiintt9 wherein the host^cell is a manunauan 



ceU; 



.21.' The process of claimio wherein the mammalian cell is an 
ovarian cell. 

22. The process of claim IB wherein the HVER is human HVER. 



23, The process of claim 18 whereinjbe-po^ncleotide has the 
nucleotide seqiience^^SEQ-ffl^W5ll from nucleotide position 293 to 
about-OTt^eoSde position 1189. 



24. ^Jgyggjiiade by the process ^ot^aim-^ 



25. A pharmaceu 
of claim 12 and a physiold] 




26. A<|$Mrmaceutical composition comgrjsing the polypeptide of 
^gjamy^t-^rt a physaologtcaBy^acceptable dfluent 

27. A plasmid selected frdni the group consisting of pBEClO, 
pBEC580, and pBL58. 
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Abstract of the Disclosure 



6 




The present invention provides isolated and purified polynucleotides 
that encod ^I rVCR of mammalian origin^ expression vectors containing 
those polynucleotides, host cells transformed with those CT^ression 
vectors, a process of making HVER using those polynucleotides and 

vectors, and isolated and purified TT 

A, 
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1 CCTTCATACC C TTCCC CTCGGCTTTG CCTGGACAGC TC TCCC GCAGGGCCCA 

61 CCTGTGTCCC C^__JGCCGC TCCACCCAGC AGGCCTGAGC CCt fCTGC TGqCAGACAC 

,2, ^STg^^^"^ CCACTCTCCT GCTGCTCGGG TTCTGAGGCA CAGCTTGTCA CACCGAGGCG 

181 GATTCTCTTT CTCTTTCTCT TCTGGCCCAC AGCCGCAGCA ATGGCGCTGA GTTCCTCTGC 

241 TGGAGTTCftT CCTGCTAGCT GGGTTCCCGA GCTGCCGGTC TGAGCCTGAG GCATGGAGCC 

^ M E p 

301 TCCTGGAGAC TGGGGGCCTC CTCCCTGGAG ATCCACCCCC AGAACCGACG TCTTGAGGCT 

^ ..™„.®..„?..„„*....^ ^ PPWR STP RTD VLRL 

361 GGTGCTGTAT CTCACCTTCC TGGGAGCCCC CTGCTACGCC CCAGCTCTGC 

24 .V„..jk..„r. Ji.„X..,l Jk..,J?.. AP CYA PAvL P 





GGGGACCTTC TCTCCAATGG GACCCTGGAG 
GDL LS MG TLB 



841 
184 

901 
204 

961 
224 

1021 
244 

1081 
264 

1141 
284 

1201 
i261 
1321 
1381 
1441 
1501 
1561 
1621 
1681 



GTGCAGCTGG 
C S W 

GTTTCTCTCA 
F L S 



CTGGTGACGA AGGCCGGAGC TGGGACCAGC 
LVT KAGA GTS 

GGGAGCCTCG TCATCGTCAT TGTTTGCTCC 
GSL VI. VI VCS 



GAATGTCAGC ACCAGACCAA 
E C Q H Q T K 

AGCTCCCACT GGGTATGGTG 
S S H W V W W 

ACAGTTGGCC TAATCATATG 
T V G I, I I C 



TGTGAAAAGA AGAAAGCCAA GGGGTGATGT AGTCAAGGTG 
JL KR RKP RGDV VKV 

AA.GACAGGAG GCAGAAGGTG AGGCCACAGT CATTGAGGCC 
R QE AB G EATV IBA 

CACCACGGTG GCC6TGAGGA GACAATACCC TCATTCACGG 
TTV AVR RQYP HSR 

CCCACAGACT CTGCACCCCG ACGCCAGAGA TACCTGGAGC 
PQ T LH P DARO TWS 

GTCCACCTGG CGAAACCACC GGAGCCCGGA GGCTTGGGGG 
CGTCTCCTCC AGTGGAGGGA GAGGTGGGGC CCCTGCTGGG 
TGCCATTCCC ATGGGCCAGT GAGGGCCTGG GGCCTCTGTT 
CAGAGTCCTG AGGAGGAGC6 CCAGTTGCCC CTCGCTCACA 
TGGGCCAGCC CAGAGGGCCC TTCAGACCCC AGCTGTCTGC 
CAGC AGGA CA GGCCCCGGGC ACTGCCTCAC AGCCAAGGCT 
TGGTCTTTAG TGGA3ACCAC ATCGGAAGTG ATTTTCTAAA 
TGT CTTC TAT TTGTCATGAA ACAGTGTATT TGGGGAGATG 
CTTGTTTCTC CTCAAAAAAA AAAAAAAAAA AAAAAAAAA 



ATCGTCTCCG TCCAGCGGAA 
I V S V Q R K 

CTGCAGGCCC CTCCGGACGT 
I. O A P P D V 

GGAGGAGCCC AAACCACTGA 
G G A Q T T D 

GACGGCTGCT GAAAGAG6CT 
D 6 C 

CTCCGCCCTG GGCTGGCTtC 
GTAGAGCTGG GGACGCCACG 
CTGCTGTGGC CTGAGCTCCC 
GACCACACAC CCAGCCCTCC 
GCGTCTGACT CTTGTGGCCT 
GGACTGGGTT GGCTGCAGTG 
TTGGATTTGA ATTCCGGTCC 
CTGTGGGAGG ATGTAAATAT 



BstXI. 




TtM 




PFU/well 




CO 
CO 

CO 

CM 

X 



i 



(|iu/ndd) P|9!A snj|A 




/ 



PRINT or UKAW1M05 
AS ORJCINAIXY FELXD 
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1 AAGCTTGCAT 
61 GGCATGGAGC 
1 HE 



GC ZAGGrr CGACTCTAGC TGGGTTCCCG AC 
C" GGAGA CTGGGGGCCT CCTCCCTGGA G? ' ' 
1 GDtfGP PPWR 



CGGT CTGAGCCTGA 
.CCCC CAGAACCGAC 
TP R T D 



121 GTCTTGAGGC 
20 V L R 



TGGTGCTGTA TCTCACCTTC CTGGGAGCCC CCTGCTACGC CCCAGCTCTG 
Jf...,y.....h.. y LTF LGA PCYA PAL 



181 CCGTCCTGCA 
40 P 




301 CCAGGCT^^ ACATTGCCCA CCTCAATGGC CTAAGCAAGT GTCTGGAGTG CCAAATGTGT 

60 ' - 



361 
100 




»GCCGCC I 5^ " 



541 CCCTGTGTCA 
160 

601 CACCAGACCA 
180 H Q T 

661 TGCCCACCCC 

200 C P P 

721 GACACCCtCA 

220 D T L 

781 GATGACCCCG 

240 D b P 

841 CCGCCGCTAC 

260 P P L 

901 ACGCACCAGG 

280 T H Q 

961 CCGGCCCCCA 

300 PAP 

1021 TACACCATGG 

320 y T M 

1081 ATCAACGGCT 

340 I N G 

1141 . GACAACTACA 

360 D H Y 

1201 AAGCTCTCAG 

380 K L 5 

1261 CACGAGGCCT 

400 H e A 



CTGAACTCCT 
PELL 

TGATCTCACG 
H I S R 

AGGTGCAGTT 
B V Q F 

GGGAGCAGCA 
R B Q Q 

ACTGGCTGAG 
D W L R 

TCGAGAAAAC 
I B K T 

GCCCTCCCCG 
G P P R 

TCTACCCTTC 
F Y P S 

AGACCACGCC 
K T T P 

TGCCCACGAG 
V P T S 

TGCACAACCA 
L H N H 



GGGGGGACCG TCTGTCTTCA 
G G P V F 

/ 

CACCCCCGAG yfexCACATGCG 
T P B/ V T C 

CACATGGTAC ATAAACAACG 
T W y I N M 



6TTCAAGAGC ACGATCCGC6 
F N 3 T I R 

GGGCAAGGAG TTCAAGTGCA 
G K E F K C 

CATCTCCAAA GCCAGAGGGC 
I S K A R G 

GGAGGAGCT6 AGCAGCAGGT 
BEL S 3 R 

CGACATCTCG GTGGAGTGG6 
D Z S V B W 

GGCCGTGCTG GACAGCGACG 
A V L DSD 

TGA6TGGCAG CGGGGCGACG 
B W Q R G D 

CTACACGGAG AAGTCCATCT 
. Y T Q K S I 



C6ACATGCAG 
5 T C S 

TCTTCCCCCC 
I F P P 

TGGTGGTGGA 

V V V D 

AGCAGGTGCG 
E Q V R 

TGGTCAGCAC 

V V S T 

AAGTCCACAA 
K V H N 

AGCCCCTGGA 
Q P L B 

CGGTCAGCCT 
S V S L 

AGAAGAACG6 
B K N G 

GCTCCTACTT 
G S Y F 

TCTTCACCTG 

V F T C 

CCCGCTCTCC 
3 R 3 P 



GGAATGTCAG 
B C Q 

CAAGCCCACG 
KPT 

AAAACCCAAG 
K P K 

CGTGAGCCAG 
V S Q 

CACCGCCCGG 
TAR 

CCTCCCCATC 
L P I 

CAAGGCACTC 
K A L 

GCCGAAGGTC 
P K V 

GACCTGCATG 
T C M 

GAAG6CAGA6 
K A B 

CCTCTACAAC 
L Y N 

CTCCGTGATG 
3 V M 

GGGTAAATGA 
G K - 
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1321 GCGCTGTGCC GGk TTGC CCCTCTCCCT CCCCCCCACG CC& TGT GCACCCCGCA 

1381 CACAAATAAA GCAw«iiAGCT CTGCCCTGAA CAGCTTCCG6 TCTC^ ,rATA GTGAGTCGTA 

1441 TTAATTTCGA TAAGCCA6CT GCATTAATGA ATCGGCCAAC GCGCGGGGAG AGGCGGTTTG 

1501 CGTATTGGGC GCTCTTCCGC TTCCTCGCTC ACTGACTCGC TGCGCTCGGT CGTTCGGCTG 

1561 CGGCGAGCG6 TATCAGCTCA CTCAAAGGCG GTAATACGGT TATCCACAGA ATCAGGGGAT 

1621 AACGCAGGAA AGAACATGTG AGCAAAA6GC CASCAAAAGG CCAGGAACCG TAAAAAGGCC 

1681 GCGTTGCTGG CGTTTTTCCA TAG^GCTCCGC CCCCCTGACG AGCATCACAA AAATCGACGC 

1741 TCAAGTCAGA GGTGGCGAAA CCCGACAGGA CTATAAAGAT ACCAGGC6TT TCCCCCTGGA 

1801 AGCTCCCTCG TGCGCTCTCC TGTTCCGACC CTGCCGCTTA CCGGATACCT 6TCCGCCTTT 

1861 CTCCCrrCGG GAAGCGTGGC GCTTTCTCAT AGCTCACGCT GTAGGTATCT CAGTTCGGTG 

1921 TAGGTCGTTC GCTCCAAGCT GGGCTGTGTG CACGAACCCC CCGTTCAGCC CGACCGCTGC 

1981 GCCTTATCCG GTAACTATCG TCTTGAGTCC AACCCGGTAA GACACGACTT ATCGCCACTG 

2041 GCAGCAGCCA CTGGTAACAG GATTAGCAGA GCGAGGTATG TAGGCGGTGC TACAGAGTTC 

2101 TTGAAGTGGT GGCCTAACTA CGGCTACACT AGAAGGACAG TATTTGGTAT CTGCGCTCTG 

2161 CTGAAGCCAG TTACCTTCGG AAAAAGAGTT GGTAGCTCTT GATCCGGCZAA ACAAACCACC 

2221 GCTGGTAGCG GTGGTTrrTT T6TTTGCAA6 GAGGAGATTA CGCGCAGAAA AAAAGGATCT 

2281 CAAGAAGATC CTTTGATCTT TTCTAC6GGG TCTGACGCTC AGTGGAACGA AAACTCACGT 

2341 TAAGGGATTT TGGTCATGAG ATTATCAAAA AGGATCTTCA CCTAGATCCT TTTAAATTAA 

2401 AAATGAAGTT TTAAATCAAr CTAAAGTATA TATGAGTAAA CTTGGTCTGA CAGTTACCAA 

2461 TGCTTAATCA GTGAGGCACC TATCTCAGCG ATCTGTCTAT TTCGTTCATC CATAGTTGCC 

2521 TGACTCCCCG TCGTGTAGAT AACTACGATA CGGGAGGGCT TACCATCTGG CCCCAGTGCT 

2581 . GCAATGATAC CGCGAGACCC ACGCTCACCG GCTCCAGATT TATCAGCAAT AAACCAGCCA 

2641 GCCGGAAGGG CCGAGCGCAG AAGTGGTCCT GCAACTTTAT CCGCCTCCAT CCAGTCTATT 

2701 AATTGTTGCC GGGAAGCTAG AGTAAGTAGT TCGCCAGTTA ATAGTTTGCG CAAC6TTGTT 

2761 GCCATTGCTA CAGGCATC6T GGTGTCACGC TCGTCGTTTG GTATGGCTtC ATTCAGCTCC 

2821 GGTTCCCAAC GATCAAGGCG AGTTACATGA TCCCCCATGT TGTGCAAAAA AGCGGTTAGC 

2881 TCCTTCGGTC CTCCGATCGT TGTCAGAAGT AAGTTGGCCG CAGTGTTATC ACTCATGGTT 

2941 ATGGCAGCAC TGCATAATTC TCTTACTGTC ATGCCATCCG TAAGATGCTT TTCTGTGACT 

3001 GGTGAGTACT CAACCAAGTC ATTCTGAGAA TAGTGTATGC GGCGACCGAG TTGCTCTTGC 

3061 CCGGCGTCAA TACGGGATAA TACCGCGCCA CATAGCAGAA CTTTAAAA6T GCTCATCATT 

3121 GGAAAACiSTT CTTCGGGGCG AAAACTCTCA AGGATCTTAC CGCTGTTGAG ATCCAGTTCG 

3181 ATGTAACCCA CTCGTGCACC CAACTGATCT TCAGCATCTT TTACTTTCAC CAGCGTTTCT 

3241 GGGTGAGCAA AAACAGGAA6 GCAAAATGCC GCAAAAAAGG GAATAAGGGC GACACGGAAA 

3301 TGTTGAATAC TCATACTCTT CCTTTTTCAA TATTATT6AA GCATTTATCA GGGTTATTGT 

3361 CTCATGAGCG GATACATATT TGAATGTATT TAGAAAAATA AACAAATAGG GGTTCCGCGC 

3421 ACATTTCCCC GAAAAGTGCC ACCTGACGTC TAAGAAACCA TTATTATCAT GACATTAACC 

3481 TATAAAAATA GGCGTATCAC GAGGCCCTTT CGTCTCGCGC GTTTGGGTGA TGACGGTGAA 

3541 AACCTCTGAC ACATGCAGCT CCCGGAGACG GTCACAGCTT GTCTGTAAGC GGATGCCGGG 

3601 AGGAGACAAG CCCGTCA6GG CGCGTCAGCG GGTGTTGGCG GGTGTCGGGG CTGGCTTAAC 

3661 TATGCGGCAT CAGAGCAGAT TGTACTGAGA GTGCACCATA TCGACGCTCT CCCTTATGCG 

3721 ACTCCTGCAT TAGGAAGCAG CCCAGTAGTA GGTTGAGGCC GTTGAGCACC GCCGCCGCAA 

3781 GGAATGGTGC AAGGAGATGG CGCCCAACAG TCCCCCGGCC ACGGGGCCTG CCACCATACC 

3841 CACGCCGAAA CAAGC6CTCA TGAGCCCGAA GTGGCGAGCC CGATCTTCCC CATCGGTGAT 

3901 GTCGGCGATA TAGGCGCCAG CAACCGCACC TGTGGCGCCG GTGATGCCGG CCACGATGCG 

3961 TCCGGCGTAG AGGATCTGGC TAGTTAXTAA TAGTAATCAA TTACGGGGTC ATTAGTTCAT 

4021 AGCCCATATA TGGAGTTCCG CGTTACATAA CTTACGGTAA ATGGCCCGCC TGGCTGACCG 

4081 CCCAACGACC CCCGCCCATT GACGTCAATA ATGACGTATG TTCCCATAGT AACGCCAATA 

4141 GGGACTTTCC ATTGACGTCA ATGGGTGGAC TATTTACGGT AAACTGCCCA CTTGGCAGTA 

4201 CATCAAGTGT ATCATATGCC AAGTACGCCC CCTATTGACG TCAATGACGG TAAATGGCCC 

4261 GCCTGGCATT ATGCCCAGTA CATGACCTTA TGGGACTTTC CTACTTGGCA GTACATCTAC 

4321 GTATTAGTCA TCGCTATTAC CATGGTGATG CGGTTTTGGC AGTACATCAA TGGGCGTGGA 

4381 TAGCGGTTTG ACTCACGGGG ATTTCGAAGT CTCCACCCCA TTGACGTCAA TGGGAGTTTG 

4441 TTTTGGCACC AAAATCAACG GGACTTTCCA AAAXGTCGTA ACAACTCCGC CCCATTGACG 

4501 CAAATGGGCG GTAGGCGTGT ACGGTGGGAG GTCTATATAA GCAGAGCTCT CTGGCTAACT 

4561 AGAGAACCCA CTGCTTAACT GGCTTATCGA AATTAATACG ACTCACTATA GGGAGACCC 
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